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A DNA construct encoding an enzyme exhibiting amylolytic activity and a) comprising a DNA sequence encoding at least one of the 
partial amino acid sequences (I), and/or b) comprises a DNA sequence hybridizing with an oligonucleotide probe prepared on the basis of 
any of the DNA sequence shown in SEQ ID Nos. 1-6, on the basis of the amino acid sequence encoded by any of the said DNA sequences 
or the amino acid sequence shown in SEQ ID No. 9, or on the basis of any of the partial amino acid sequences (a)-(R) listed in a), and/or 
c) encodes a polypeptide being at least 70 % homologous with the amino acid sequence shown in SEQ ID No. 9, as well as an amylolytic 
enzyme encoded by the DNA construct The amylolytic enzyme is preferably of archaebacterial origin, such as derivable from a strain of 
Pyrococcus sp., e.g. P.furiosus, and may, e.g., be used for starch liquefaction. 
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AN AMYLOLYTIC ENZYME 

FIELD OF THE INVENTION 

5 The present invention relates to a DNA construct comprising a 
DNA sequence encoding an amylolytic enzyme, in particular a 
Pyrococcus a-amylase or a variant thereof, and vector and cell 
harbouring the DNA construct. Furthermore, the invention re- 
lates to a process for producing the amylolytic enzyme by use 
10 of recombinant DNA techniques. 

BACKGROUND OF THE INVENTION 

15 During the last decade enzymes produced by thermophilic micro- 
organisms such as archaebacteria have been the subject of in- 
creasing interest mainly because of their high thermostability 
which, for a number of industrial applications, is desirable. 

20 Examples of such hyperthermophilic enzymes are those produced 
by strains of the thermophilic archaebacterium Pyrococcus. For 
instance, WO 90/11352 discloses novel Pyrococcus alpha-amylases 
obtained from strains of the species P. woesei and P. furiosus 
by conventional fermentation procedures involving culturing of 

25 the Pyrococcus strains and isolating alpha-amylase preparations 
therefrom. Furthermore, Koch et al. (1990) and Brown et al. 
(1990) describe partially purified P. furiosus a-amylase. WO 
92/02614 discloses novel thermostable pullulanases obtained 
from Pyrococcus spp. 

30 

The Pyrococcus a-amylases and pullulanases disclosed in WO 
90/11352 and WO 92/02614, respectively, have been found to 
possess an extremely high thermostability as compared to other 
known a-amylases and pullulanases, and are stated to be useful 
35 in high-temperature processes involving a-amylase or pullulana- 
se activity. 
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It would be desirable to facilitate the production of such anx- 
iolytic enzymes, and amylolytic enzymes in general, both with 
a view to improve the purity thereof and with a view to provide 
larger amounts of the purified enzyme at lower cost than what 
5 is possible by cultivation of a parent strain capable of ex- 
pressing the enzyme. 

BRIEF DISCLOSURE OF THE INVENTION 

10 

The present inventor have now succeeded in cloning a DNA se- 
quence encoding an a-amylase from a strain of the archebac- 
terial species Pyrococcus furiosus and obtaining a-amylase 
expression from a host cell harbouring said DNA sequence. The 
15 present invention is based on this finding. 

More specifically, in a first aspect the invention relates to 
a DNA construct comprising a DNA sequence encoding a Pyrococcus 
a-amylase or a variant thereof having a-amylase activity and/ or 
20 being immunologically cross-reactive with a Pyrococcus a-amy- 
lase, said DNA sequence 

i) comprises a partial DNA sequence as shown in SEQ ID Nos. 2, 
3, 4, 5 and/or 6 or an analogue of said partial sequence 

25 capable of hybridizing with an oligonucleotide probe prepared 
on the basis of the DNA sequence shown in SEQ ID No. 2, 3, 4, 
5 and/or 6, 

ii) corresponds to a genomic Pyrococcus DNA sequence located 
30 within 5 kb of a genomic DNA sequence hybridizing with an oli- 
gonucleotide probe prepared on the basis of the DNA sequence 
shown in SEQ ID No. 2, 3, 4, 5 and/ or 6, and/ or 

iii) comprises the DNA sequen ce sho wn in SEQ ID No. 1 or an 
35 analogue of said sequence ( capable of hybridizing^ with an oli- 
gonucleotide probe prepared on the basis of the DNA sequence 
shown in SEQ ID No. 1. 
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3 

The DNA sequence shown in SEQ ID No, 1 comprises an open 
reading frame encoding an a-amylase. The partial DNA sequences 
2-6 constitute either part of or flank this open reading frame 
as will be further explained below, 

5 

The term "corresponds" as used about the DNA sequence with 
properties ii) of the DNA construct of the invention is in- 
tended to indicate that the DNA sequence may be of any origin 
including a genomic, cDNA and/or synthetic origin as will be 
10 explained in further detail below* 

In a further aspect the present invention relates to a DNA 
construct encoding an enzyme exhibiting amylolytic activity, 
which DNA construct 

15 

a) comprises a DNA sequence encoding at least one of the par- 
tial amino acid sequences 





(a) 


AKYLELEEGG (SEQ ID NO 10) ; (b) 


VIMQAFYWDV ( SEQ 


ID 


NO 


ii); 


20 


(c) 


PGGGIWWDHI(SEQ 


ID 


NO 


12) j 


' (d) 


RSKIPEWYEA ( SEQ 


ID 


NO 


13); 




(e) 


GISAIWLPPP(SEQ 


ID 


NO 


14)j 


' (f) 


SKGMSGGYSM ( SEQ 


ID 


NO 


15); 




(g) 


GYDPYDYFDL ( SEQ 


ID 


NO 


16), 


• 00 


GEYYQKGTVE ( SEQ 


ID 


NO 


17); 




(i) 


TRFGSKEELV ( SEQ 


ID 


NO 


18) j 


f (j) 


RLIQTAHAYG ( SEQ 


ID 


NO 


19) ; 




00 


IKVIADWIN(SEQ 


ID 


NO 


20) j 


' (1) 


HRAGGDLEWN ( SEQ 


ID 


NO 


21); 


25 


(m) 


PFVGDYTWTD ( SEQ 


ID 


NO 


22) ; 


i (n) 


FSKVASGKYT ( SEQ 


ID 


NO 


23) ; 




(o) 


ANYLDFHPNE ( SEQ 


ID 


NO 


24) J 


' (P) 


LHCCDEGTFG ( SEQ 


ID 


NO 


25) ; 




(q) 


GFPDI CHHKE ( SEQ 


ID 


NO 


26) ; 


'< (r) 


WDQYWLWKSN (SEQ 


ID 


NO 


27) ; 




(s) 


ESYAAYLRSI ( SEQ 


ID 


NO 


28) j 


t (t) 


GFDGWRFDYV ( SEQ 


ID 


NO 


29) ; 




(u) 


KGYGAWWRD ( SEQ 


ID 


NO 


30) , 


• (v) 


WLNWWGGWAV ( SEQ 


ID 


NO 


31); 


30 


(x) 


GEYWDTNVDA ( SEQ 


ID 


NO 


32) , 


- (y) 


LLSWAYESGA ( SEQ 


ID 


NO 


33) ; 




(z) 


KVFDFPLYYK ( SEQ 


ID 


NO 


34) , 


(A) 


MDEAFDNNNI (SEQ 


ID 


NO 


35) ; 




(B) 


PALVYALQNG ( SEQ 


ID 


NO 


36) , 


' (C) 


QTWSRDPFK ( SEQ 


ID 


NO 


37) ; 




(D) 


AVTFVANHDT ( SEQ 


ID 


NO 


38) , 


? (E) 


DIIWNKYPAY (SEQ 


ID 


NO 


39) ; 




(F) 


AFILTYEGQP ( SEQ 


ID 


NO 


40) , 


i (G) 


VIFYRDFEEW ( SEQ 


ID 


NO 


41) ; 


35 


(H) 


LNKDKLINLI (SEQ 


ID 


NO 


42) , 


' (I) 


WIHDHLAGGS ( SEQ 


ID 


NO 


43); 




(J) 


TTIVYYDNDE ( SEQ 


ID 


NO 


44) 


' (K) 


LIFVRNGDSR ( SEQ 


ID 


NO 


45) ; 




(L) 


RPGLITYINL ( SEQ 


ID 


NO 


46) , 


; (M) 


SPNWVGRWVY(SEQ 


ID 


NO 


47) ; 




(N) 


VPKFAGACIH ( SEQ 


ID 


NO 


48) 


; (o) 


EYTGNLGGWV ( SEQ 


ID 


NO 


49) ; 
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(P) DKRVDSSGWV(SEQ ID NO 50); (Q) YLEAPPHDPA (SEQ ID NO 51); 
(R) NGYYGYSVWSYCGVG (SEQ ID NO 52), and/or 

b) comprises a DNA sequence hybridizing with an oligonucleotide 
5 probe prepared on the basis of any of the DNA sequence shown in 

SEQ ID Nos. 1-6, on the basis of the amino acid sequence encod- 
ed by any of the said DNA sequences or the amino acid sequence 
shown in SEQ ID No. 9, or on the basis of any of the partial 
amino acid sequences (a)-(R) listed in a) above, and/or 

10 

c) encodes a polypeptide being at least 70% homologous with the 
amino acid sequence shown in SEQ ID No. 9. 

In a further aspect the invention relates to an amylolytic en- 
15 zyme encoded by this DNA construct. 

In the present context, the term "amylolytic activity" is in- 
tended to indicate that the enzyme in question has a starch- 
degrading capability. Specific examples of enzymes having amy- 

2 0 lolytic activity, i.e. amylolytic enzymes, includes a-amylases, 

pullulanases , neo-pullulanases , iso-amylases , beta-amylases , 
CTGases, maltogenases as well as G-4 and G-6 amylases. 

It is generally known that besides the above mentioned common 
25 functional feature, amylolytic enzymes have structural features 
in common. Thus, it has been found that the secondary structure 
of amylolytic enzymes comprises regions of high homology, in 
other words amino acid regions being highly conserved between 
the different types of amylolytic enzymes, vide, e.g., Pod- 
30 kovyrov (1992) , Zhou (1989) and Svensson (1988) . 

Partial amino acid sequences shown above, which constitute part 
of the Pyrococcus a-amylase having the amino acid sequence 
shown in SEQ ID No. 9 and which have been found to be novel as 

3 5 such, may therefore be found to be characteristic not only for 

the Pyrococcus a-amylase disclosed herein, but also for other 
amylolytic enzymes. These partial sequences are contemplated to 
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constitute important tools in the identification and isolation 
of novel amylolytic enzymes. 

In a still further aspect the present invention relates to a 
5 vector harbouring the DNA construct of the invention, and a 
cell which either harbours the DNA construct or the vector of 
the invention. 

In the course of the research leading to the present invention 
10 it was surprisingly found to be possible to obtain a-amylase 
expression from a strain of Bacillus transformed with a Pyro- 
coccus DNA sequence without any modification of said DNA se- 
quence being required. Such modification which, e.g. involves 
modification or replacement of the ribosome binding site of the 
15 gene to be translated, is normally considered to be a prerequi- 
site for obtaining an efficient translation and thereby expres- 
sion from non-gram positive DNA sequences in Bacillus , cf . also 
the explanation given in Example 6 hereinafter. As far as the 
present inventors are aware there have been no prior disclosure 

2 0 of the expression of Pyrococcus a-amylase genes in Bacillus . WO 

93/10248, which was published only after the priority date of 
the present application, discloses the use of Bacillus licheni- 
formis as a host cell for the recombinant production of certain 
proteins including Pyrococcus a-amylase. 

25 

Based on the above described finding it is contemplated that 
direct expression of Pyrococcus a-amylase genes, in general, 
may be obtained in a strain of Bacillus and accordingly, in a 
particular aspect the invention relates to a recombinant Bacil- 

3 0 lus cell, which is different from a cell of Bacillus licheni- 

formis, and which harbours a DNA construct comprising a DNA 
sequence encoding a Pyrococcus a-amylase or a variant thereof, 
the DNA construct optionally being present on an expression 
vector . 

35 

In a further aspect the present invention relates to a process 
for the production of a recombinant amylolytic enzyme, in par- 
ticular a recombinant Pyrococcus a-amylase or variant thereof, 



WO 94/19454 



6 



PCT/DK94/00071 



comprising culturing a cell as described above in a suitable 
culture medium under conditions permitting expression of the 
amylolytic enzyme, and recovering the resulting amylolytic 
enzyme from the culture. 

5 

By use of the process of the invention it is contemplated that 
an amylolytic enzyme, such as a Pyrococcus a-amylase or a va- 
riant thereof, encoded by the DNA construct of the invention, 
may be produced in high amounts and in a high purity, thereby, 
10 for instance, being able to optimize the production of the 
amylolytic enzyme in question in mono-component form essen- 
tially free from any other enzymatic activities. 

The present invention also relates to a recombinant Pyrococcus 
15 a-amylase comprising the amino acid sequence shown in SEQ ID 
No. 9 or a variant thereof having a-amylase activity, being 
immunologically cross-reactive with the polypeptide comprising 
the amino acid sequence shown in SEQ ID No. 9 and/ or being at 
least 70% homologous with the amino acid sequence shown in SEQ 
20 ID No. 9. 

In the present context the term "variant" is intended to in- 
clude a polypeptide which comprises an amino acid sequence 
differing from that of SEQ ID No. 9 by one or more amino acid 

25 residues. The variant may be prepared by suitably modifying a 
DNA sequence encoding the a-amylase, preferably a DNA sequence 
present in a DNA construct of the invention, resulting in the 
addition of one or more amino acid residues to either or both 
the N- and C-terminal end of the a-amylase, substitution of one 

30 or more amino acid residues at one or more different sites in 
the amino acid sequence, deletion of one or more amino acid 
residues at either or both ends of the a-amylase or at one or 
more sites in the amino acid sequence, or insertion of one or 
more amino acid residues at one or more sites in the amino acid 

35 sequence. The modification of the DNA sequence may be performed 
by site-directed or by random mutagenesis or a combination of 
these techniques in accordance with well-known procedures. Sub- 
sequent to the modification, the gene product of the modified 
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DNA is expressed and tested for a-amylase activity, immunologi- 
cal cross-reactivity with a purified Pyrococcus a-amylase, or 
homology with the amino acid sequence shown in SEQ ID No. 9. It 
will be understood that the term "variant" is intended to in- 
5 elude a subsequence of a Pyrococcus a-amylase having a-amylase 
activity and/ or being reactive with an antibody raised against 
a Pyrococcus a-amylase. 

It is contemplated that the Pyrococcus a-amylase encoded by the 
10 DNA construct of the invention is particularly suitable for use 
in starch liquefaction carried out at high temperatures. Accor- 
dingly, in a still further aspect the invention relates to a 
starch liquefaction process which comprises subjecting an 
aqueous starch slurry to enzymatic liquefaction in the presence 
15 of a Pyrococcus a-amylase or a variant of the invention. 

In a final aspect the invention relates to the use of an amy- 
lolytic enzyme of the invention for starch modification, in- 
cluding starch liquefaction and/ or saccharif ication, debran- 
2 0 ching of starch, production of various syrups, e.g. maltose 
syrup, production of cyclodextrin, and production of oligosac- 
charides . 



25 DETAILED DESCRIPTION OF THE INVENTION 

The partial DNA sequences apparent from SEQ ID Nos. 2, 3, 4, 5 
and 6 were derived from genomic clones prepared from a strain 
of the archaebacterial species Pyrococcus furiosus harbouring 

3 0 a DNA sequence of about 5 kb which encodes an a-amylase activ- 
ity, cf. the examples hereinafter. The partial DNA sequences 
SEQ ID Nos. 3 and 6, which were identified in different clones, 
are identical except for the sequence SEQ ID No. 6 being longer 
than the sequence SEQ ID No. 3. Both of the sequences comprises 

35 the N-terminal part of the coding sequence of the a-amylase 
starting at position 7 (GTG) . SEQ ID Nos. 2 and 4 constitute 
internal parts of the coding sequence shown in SEQ ID No. 1 
starting in the same Xbal site (TCTAGA) . SEQ ID No. 2 is read 
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in the upstream direction, whereas SEQ ID No . 4 is read in the 
downstream direction. The SEQ ID No. 5 is located about 3.3 kb 
downstream of the 3' end of the coding sequence and is read up- 
stream. 

5 

The partial DNA sequences (SEQ ID Nos. 2, 3, 4, 5 and 6) may 
either separately or in combinations of two or more be used in 
the isolation of DNA sequences encoding a Pyrococcus a-amylase, 
cf. the examples herein. Thus, these DNA sequences have been 
10 found to constitute important tools in the isolation of the DNA 
sequence identified in SEQ ID No. 1 and thus in the preparation 
of recombinant Pyrococcus a-amylase. 

In the DNA construct of the invention encoding a Pyrococcus a- 

15 amylase or a variant thereof, the analogue of the DNA sequence 
shown in SEQ ID No. 1 or of any of the partial DNA sequences 
shown in SEQ ID Nos. 2, 3, 4, 5 and 6 may, for instance, be a 
subsequence of any of these DNA sequences, a genetically en- 
gineered modification of said sequences which may be prepared 

20 by well-known procedures, e.g. by site-directed mutagenesis, or 
a DNA sequence encoding a Pyrococcus a-amylase or a variant 
thereof isolated from a strain of a Pyrococcus spp. In any 
event the analogous DNA sequence should hybridize to an oligo- 
nucleotide probe which may be prepared on the basis of the DNA 

25 sequence shown in SEQ ID No. 1 or of any of the partial DNA 
sequences SEQ ID Nos. 2, 3, 4, 5 or 6, e.g. constituting a 
subsequence or the entire sequence thereof. Alternatively, a 
suitable oligonucleotide probe may be prepared on the basis of 
an amino acid sequence encoded by a DNA construct of the inven- 

30 tion harbouring one or more of the DNA sequence SEQ ID Nos. 1, 
2, 3, 4, 5, and 6, e.g. on the basis of any part of the amino 
acid sequences shown in SEQ ID Nos. 8 or 9. 

On the basis of the amino acid sequence shown in SEQ ID No. 9 
35 of a mature a-amylase enzyme, an analysis has been made of the 
extent to which the DNA sequence shown in SEQ ID No. 1 may vary 
without effecting it's a-amylase encoding capability. As a re- 
sult of this analysis it has been found that the most "differ- 
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ent" DNA sequence having retained the capability of encoding 
the polypeptide having the amino acid sequence shown in SEQ ID 
No. 9 is one which shows a homology of about 62.6% with the DNA 
sequence shown in SEQ ID No. 1. 

5 

Accordingly, the analogue of the DNA sequence shown in SEQ ID 
No. 1 is preferably one being at least 62.5% homologous to the 
said DNA sequence, more preferably at least 70% homologous, 
such as at least 80% homologous, still more preferably at least 

10 90% homologous with the DNA sequence shown in SEQ ID No. 1. In 
the present context the homology is determined as the degree of 
identity between the two sequences indicating a derivation of 
the first sequence from the second. In a particular embodiment, 
the analogue of the DNA sequence shown in SEQ ID No. 1 is a 

15 synthetic gene. 

The hybridization of a DNA sequence with the relevant oligo- 
nucleotide probe (s) may be carried out under any suitable con- 
ditions allowing the DNA sequences to hybridize. For instance, 

20 such conditions are hybridization under specified conditions, 
e.g. involving presoaking in 5xSSC and prehybridizing for lh at 
-40 °C in a solution of 20% formamide, 5xDenhardt's solution, 
50mM sodium phosphate, pH 6.8, and 50/xg of denatured sonicated 
calf thymus DNA, followed by hybridization in the same solution 

25 supplemented with 100/xM ATP for 18h at -40 °C, or other methods 
described by e.g. Sambrook et al., 1989. 

The immunological cross-reactivity of a variant of a Pyrococcus 
a-amylase encoded by the DNA construct of the invention may be 

3 0 assayed using an antibody raised against or reactive with at 
least one epitope of a Pyrococcus a-amylase, which may be of 
recombinant or native origin, and preferably encoded by a DNA 
construct of the invention. The antibody, which may either be 
monoclonal or polyclonal, may be produced by methods known in 

35 the art, e.g. as described by Hudson et al., 1989. The immu- 
nological cross-reactivity may be determined using assays known 
in the art, examples of which are Western Blotting or radial 
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10 

immunodiffusion assay, e.g. as described by Hudson et al., 
1989. 

It has been found that the partial DNA sequences identified in 
5 SEQ ID No. 5 and 6 are flanking the a-amylase encoding part of 
the genomic DNA sequence isolated from the P. furiosus strain 
DSM 3638 as described in the examples hereinafter, the start 
codon of the a-amylase encoding DNA constituting part of the 
partial sequence SEQ ID No. 6. It is contemplated that analog- 

10 ous DNA sequences as defined above similarly may be found to 
flank other Pyrococcus a-amylase encoding sequences or se- 
quences encoding other amylolytic enzymes. Accordingly, in the 
DNA construct of the invention, the DNA sequence encoding the 
amylolytic enzyme, and in particular the Pyrococcus a-amylase 

15 or a variant thereof, preferably corresponds to a genomic Pyro- 
coccus DNA fragment located between and optionally comprising 
the partial DNA sequences identified in the appended SEQ ID 
Nos. 5 and 6 or analogues thereof capable of hybridizing with 
an oligonucleotide probe prepared on the basis of the DNA 

20 sequence shown in SEQ ID No. 5 and 6, respectively. 

Also DNA constructs harbouring a DNA sequence hybridizing with 
a DNA sequence having any of the properties i)-iii) above are 
considered to be within the invention. Such DNA constructs may, 
25 but need not comprise one or more of the sequences shown in the 
SEQ ID Nos. 1-6 or analogues thereof. 

As stated above, the DNA sequences shown in SEQ ID Nos. 1-6 
were determined from a genomic clone prepared from a strain of 

30 Pyrococcus furiosus, more specifically from the P. furiosus 
strain DSM 3638 available from the Deutsche Sammlung von 
Mikroorganismen und Zellkulturen GmbH. Due to the high degree 
of homology normally found within the group of amylolytic 
enzymes it is contemplated that DNA sequences, which are 

35 homologous to the DNA sequences shown in SEQ ID Nos. 1-6 and 
which encode amylolytic enzymes other than the Pyrococcus a- 
amylase disclosed herein, may be identified in other organisms, 
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including thermophilic organisms such as archaebacteria and 
other types of organisms living under extreme conditions. 

Accordingly, the DNA sequence of a DNA construct of the inven- 
5 tion may be derived from an archaebacterium, in particular a 
thermophilic archaebacterium such as a strain of the genus 
Pyrococcus, especially from a strain of P. woesei or P. 
furiosus. Such strains may be isolated by established pro- 
cedures from places expected to harbour archaebacteria, e.g. 
10 hot springs or the like, or may be obtained from publicly 
available culture collections. 

An example of an analogous DNA sequence is a DNA sequence 
derivable from the P. woesei strain DSM 3773 available from the 

15 Deutsche Sammlung von Mikroorganismen und Zellkulturen GmbH. 
Chromosomal DNA isolated from this strain has been found to 
hybridize with the 4.5 kb genomic fragment containing the a- 
amylase gene from P. furiosus (cf . examples 3 and 4) , and thus 
constitutes an example of a DNA sequence hybridizing with a DNA 

20 sequence having the properties i)-iii) above. Furthermore, it 
is contemplated that analogous DNA sequences of the DNA con- 
struct of the invention are derivable from mutants and deriva- 
tives of the deposited P. furiosus strain DSM 3638 and P. 
woBsei strain DSM 3773 having retained their oc-amylase produc- 

25 ing capability. 

In the DNA construct of the invention comprising a DNA sequence 
encoding an amylolytic enzyme, the homology with the amino acid 
sequence shown in SEQ ID No. 9, i.e. property c) , is intended 

3 0 to indicate the degree of identity between the polypeptide en- 
coded by the DNA sequence and the amino acid sequence shown in 
SEQ ID No. 9 indicating a derivation of the first sequence from 
the second. In particular, a polypeptide is considered to be 
homologous to the amino acid sequence shown in SEQ ID No. 9 if 

35 a comparison of the respective amino acid sequences reveals an 
identity of greater than about 70% such as greater than about 
75%, 80%, 85%, 90% or even 95%. Sequence comparisons can be 
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performed via known algorithms, such as the one described by 
Lipman and Pearson (1985) . 

A preferred example of a DNA construct of the invention enco- 
5 ding an amylolytic enzyme is a DNA construct encoding a Pyro- 
coccus a-amylase or a variant thereof as described above. 

The DNA sequence of the DNA constructs of the invention may be 
prepared by well-known methods. Thus, the DNA sequence may, for 

10 instance, be isolated by establishing a cDNA or genomic library 
from an organism expected to harbour the sequence, e.g. a cell 
as described above, and screening for positive clones by 
conventional procedures. Examples of such procedures are 
hybridization to oligonucleotide probes as described above in 

15 accordance with standard techniques (cf. Sambrook et al., 
1989) , and/or selection for clones expressing amylolytic, such 
as a-amylase activity, and/ or selection for clones producing a 
protein which is reactive with an antibody raised against an 
amylolytic enzyme such as a Pyrococcus a-amylase encoded by the 

2 0 DNA construct of the invention. 

A preferred method of isolating a DNA construct of the inven- 
tion from a cDNA or genomic library is by use of polymerase 
chain reaction (PCR) using degenerate oligonucleotide probes 
25 prepared on the basis of any of the DNA sequences shown in SEQ 
ID Nos. 2-6 or of an amino acid sequence encoded by a DNA 
construct of the invention, e.g. an amino acid sequence as 
shown in SEQ ID No. 8 or 9 or any of the partial amino acid 
sequences (a)-(R) disclosed above. For instance, the PCR may be 

3 0 carried out using the techniques described in US Patent No. 

4,683,202 or by R.K. Saiki et al. (1988). 

Alternatively, the DNA sequence of the DNA construct of the 
invention may be prepared synthetically by established standard 
35 methods, e.g. the phosphoamidite method described by Beaucage 
and Caruthers (1981) , or the method described by Matthes et al. 
(1984) . According to the phosphoamidite method, oligonucleoti- 
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des are synthesized, e.g. in an automatic DNA synthesizer, puri- 
fied, annealed, ligated and cloned in appropriate vectors. 

Finally, the DNA construct may be of mixed genomic and synthe- 
5 tic, mixed synthetic and cDNA or mixed genomic and cDNA origin 
prepared by ligating fragments of synthetic, genomic or cDNA 
origin (as appropriate) , the fragments corresponding to various 
parts of the entire recombinant DNA molecule, in accordance 
with standard techniques. 

10 

As stated above, the DNA constructs of the invention may also 
comprise a genetically modified DNA sequence. Such sequence may 
be prepared on the basis of a genomic or cDNA sequence encoding 
an amylolytic, such as cr-amylase activity, suitably modified at 

15 a site corresponding to the site(s) of the polypeptide at which 
it is desired to introduce amino acid substitutions, e.g. by 
site-directed mutagenesis using synthetic oligonucleotides en- 
coding the desired amino acid sequence for homologous recom- 
bination in accordance with well-known procedures, or by use of 

2 0 random mutagenesis, e.g. through radiation, chemical treatment 
or PCR using random oligonucleotide primers. 

Examples of suitable modifications of the DNA sequence are 
nucleotide substitutions which do not give rise to another 

2 5 amino acid sequence of the polypeptide, but which may corre- 

spond to the codon usage of the host organism into which the 
recombinant DNA molecule is introduced, or nucleotide substi- 
tutions which do give rise to a different amino acid sequence 
and therefore, possibly, a different polypeptide structure 

3 0 without, however, impairing the essential properties of the 

polypeptide related to amylolytic activity and/ or immunologi- 
cally cross-reactivity as described above. Other examples of 
possible modifications are insertion of one or more nucleotides 
into the sequence, addition of one or more nucleotides at 
35 either end of the sequence and deletion of one or more nucleo- 
tides at either end of or within the sequence. 
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The vector carrying a DNA construct of the invention, which is 
preferably a recombinant expression vector, may be any vector 
which may conveniently be subjected to recombinant DNA proce- 
dures, and the choice of expression vector will often depend on 
5 the host cell into which it is to be introduced. Thus, the 
vector may be an autonomously replicating vector, i.e. a vector 
which exists as an extrachromosomal entity, the replication of 
which is independent of chromosomal replication, e.g. a plasmid 
or a bacteriophage. Alternatively, the vector may be one which, 
10 when introduced into a host cell, is integrated into the host 
cell genome and replicated together with the chromosome (s) into 
which it has been integrated. 

In the DNA construct or the vector, the DNA sequence should be 
15 operably connected to a suitable promoter sequence. The pro- 
moter may be any DNA sequence which shows transcriptional ac- 
tivity in the host cell of choice and may be derived from genes 
encoding proteins either homologous or heterologous to the host 
cell. Examples of suitable promoters for directing the tran- 
20 scription of the DNA construct of the invention are the pro- 
moter of the lac operon of E.coli, the Streptomyces coelicolor 
agarase gene dagA promoters, the promoters of the Bacillus li- 
chenlformis a-amylase gene (amyL) , the promoters of the Bacil- 
lus stearothermophilus maltogenic amylase gene (amyM) , the pro- 
25 moters of the Bacillus Amy lolique faciGns a-amylase (aroyQ) , etc. 

The DNA construct and/ or expression vector of the invention may 
also comprise a suitable terminator operably connected to the 
DNA sequence encoding the amylolytic enzyme such as the Pyrococ- 
3 0 cus a-amylase or variant thereof of the invention. An example 
of a suitable terminator is that of the Bacillus licheniformis 
a-amylase gene. 

The DNA construct and/ or vector may further comprise a DNA 
35 sequence enabling the vector to replicate in the host cell in 
question. Examples of such sequences are the origins of repli- 
cation of plasmids pUC19 , pACYC177, pUBHO, pE194, pAMBl and 
pIJ702. 
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The DNA construct and/ or vector may also comprise a selectable 
marker, e.g. a gene the product of which complements a defect 
in the host cell, such as the as the dal genes from B.subtilis 
or B. lichen! f ormis , or one which confers antibiotic resistance 
5 such as ampicillin, kanamycin, chloramphenicol or tetracyclin 
resistance. 

While intracellular expression may be advantageous in some 
respects, e.g. when using certain bacteria as host cells, it is 

10 generally preferred that the expression is extracellular. In 
order to obtain extracellular expression, the DNA construct 
and/or expression vector should normally further comprise a DNA 
sequence encoding a preregion, i.e. a signal peptide, permit- 
ting secretion of the expressed amylolytic enzyme, such as a 

15 Pyrococcus a-amylase or variant thereof , into the culture me- 
dium. 

The procedures used to construct the DNA construct of the 
invention comprising ligating a DNA sequence encoding an amy- 
20 olytic enzyme, e.g. the Pyrococcus a-amylase or variant there- 
of, the promoter, terminator and other elements, respectively, 
and to insert them into suitable vectors containing the infor- 
mation necessary for replication, are well known to persons 
skilled in the art (cf., for instance, Sambrook et al. (1989)). 

25 

The cell of the invention either comprising a DNA construct or 
an expression vector of the invention as defined above is 
advantageously used as a host cell in the recombinant produc- 
tion of a polypeptide of the invention. The cell may be trans- 

3 0 formed with the DNA construct of the invention, conveniently by 
integrating the DNA construct in the host chromosome, although 
the DNA construct may also exist as an extrachromosomal entity. 
However, the integration is generally considered to be an 
advantage as the DNA sequence is more likely to be stably main- 

35 tained in the cell. Integration of the DNA constructs into the 
host chromosome may be performed according to conventional 
methods, e.g. by homologous recombination. Alternatively, the 
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cell may be transformed with an expression vector as described 
below in connection with the different types of host cells. 

The cell of the invention may be a cell of a higher organism 
5 such as a mammal or an insect, but is preferably a microbial 
cell, e.g. a bacterial or a fungal (including yeast) cell. 

Examples of suitable bacteria are grampositive bacteria such as 
Bacillus subtllls, Bacillus licheniformis, Bacillus lentus , 

10 Bacillus brevis, Bacillus stearothermophilus , Bacillus alkalo- 
philus, Bacillus amyloliquefaciens, Bacillus coagulans , Bacil- 
lus circulans, Bacillus lautus , Bacillus thuringiensis or 
Streptomyces lividans or Streptomyces murinus , or gramnegative 
bacteria such as E.coli. The transformation of the bacteria may 

15 for instance be effected by protoplast transformation or by 
using competent cells in a manner known per se. 

The yeast organism may favourably be selected from a species of 
Saccharomyces or Schizosaccharomyces , e.g. Saccharomyces ce- 

2 0 revisiae. The filamentous fungus may advantageously belong to 

a species of Aspergillus, e.g. Aspergillus oryzae or Asper- 
gillus niger. Fungal cells may be transformed by a process in- 
volving protoplast formation and transformation of the proto- 
plasts followed by regeneration of the cell wall in a manner 
25 known per se. 

The recombinant Bacillus cell of the invention, which is 
different from a cell of Bacillus licheniformis, and which 
harbours a DNA construct comprising a DNA sequence encoding a 

3 0 Pyrococcus a-amylase or a variant thereof, is preferably 

selected from a strain of Bacillus subtilis , Bacillus lentus , 
Bacillus brevis, Bacillus stearothermophilus , Bacillus al- 
kalophilus, Bacillus amyloliquefaciens, Bacillus coagulans, 
Bacillus circulans , Bacillus lautus or Bacillus thuringiensis . 

35 

The DNA construct harboured in the recombinant Bacillus cell of 
the invention is preferably a DNA construct as defined above, 
in which the DNA sequence encoding a Pyrococcus a-amylase or a 
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variant thereof is derivable from a strain of Pyrococcus woesei 
or from a strain of Pyrococcus furlosus , in particular from the 
Pyrococcus woesei strain DSM 3773 or the Pyrococcus furlosus 
strain DSM 3638 or mutants or derivatives of said strains 
5 having retained their a-amylase producing capability. 

In the process of the invention for producing an amylolytic 
enzyme, such as a Pyrococcus a-amylase or a variant thereof, a 
cell of the invention as defined above is cultured in a suit- 
10 able culture medium under conditions permitting expression of 
the amylolytic enzyme, and the resulting enzyme is recovered 
from the culture. 

The medium used to cultivate the cells may be any conventional 
15 medium suitable for growing the host cell in question. Suitable 
media are available from commercial suppliers or may be 
prepared according to published recipes (e.g. in catalogues of 
the American Type Culture Collection) . 

20 The amylolytic enzyme, e.g. the Pyrococcus a-amylase or variant 
thereof, may be recovered from the medium by conventional 
procedures including separating the cells from the medium by 
centrifugation or filtration, if necessary after disruption of 
the cells, precipitating the proteinaceous components of the 

2 5 supernatant or filtrate by means of a salt, e.g. ammonium sul- 

phate, followed by purification by a variety of chromatographic 
procedures, e.g. ion exchange chromatography, affinity chroma- 
tography, or the like. 

3 0 The Pyrococcus a-amylase or variant thereof encoded by a DNA 

construct and/ or produced by the process of the invention may 
be used in any process in which a-amylase activity is required, 
for instance in starch conversion or liquefaction processes 
used in the wine or fruit industry, for modifying starch to be 
3 5 used in the pulp and paper industry or for textile desizing 
using in the preparation of textiles. Also, the polypeptide may 
be used in baking in order to improve the properties of dough 
and/or baked products and for detergent purposes, e.g. as a 
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constituent of a detergent additive or a detergent composition. 
Other uses involves degradation of biological waste, biomass 
conversion or degradation, or the preparation of energy from 
biological material. 

5 

Due to its high thermostability the Pyrococcus a-amylase or 
variant thereof of the invention is contemplated to be of 
particular use for processes in which a high thermostability is 
advantageous. An example of such a process is a starch lique- 
10 faction process, e.g. performed as disclosed in WO 90/113 52, 
the contents of which is hereby incorporated by reference. 

More specifically, the starch liquefaction process of the 
invention may be used for enzymatic conversion of starch into 

15 sugars, e.g. for the production of fuel alcohol or High 
Fructose Syrup (HFS) . Suitable liquefaction conditions are up 
120 minutes at 100-140°C, more preferred 1-60 minutes at 100- 
120°C, most preferred 1-30 minutes at 105-110°C, optionally 
followed by reduction of the temperature to be held in the 

20 range of 90-100 °C for about 30-120 minutes. It is preferred not 
to add calcium salts to the aqueous starch slurry. The pH 
should be held within 3.5-6.0, more preferred 4.0-5.5, most 
preferred 4.2-4.8. A continuous process is preferred, and the 
heating is most preferably by jet-cooking. The dosage level of 

25 the a-amylase of the invention or a variant thereof is typical- 
ly in the range of 5-500 NU, preferably 10-50 NU, per gram 
starch DS (dry substance) . The starch concentration will 
usually be in the range 15-45% DS (w/w% dry substance) , most 
often 25-35% DS. 

30 

The activity standard NU (which is an abbreviation of NOVO a- 
amylase unit) is the amount of enzyme which hydrolyses 5.26 mg 
of dissolved starch per hour at 37°C, pH 5.6 and 0.0043 M of 
Ca ++ over a 7-2 0 minute reaction time. A folder AF9/6 describing 
35 the analytical method is available on request to NOVO NORDISK 
A/S, DENMARK. The activity of the Pyrococcus a-amylase is 
determined at 60 °C and related to a Termamyl standard, assayed 
under the same conditions. 
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The liquefied starch may hereafter be subjected to enzymatic 
saccharif ication in the presence of a glucoamylase, substan- 
tially without an intermediate pH adjustment. In this case the 
starch is liquefied with an a-amylase or variant of the inven- 
5 tion at pH 3.5-6.0, more preferred at pH 4.0-4.5, most pre- 
ferred pH 4.2-4.8. The liquefied starch may also be subjected 
to subsequent enzymatic saccharif ication in the presence of a 
glucoamylase in combination with a debranching enzyme such as 
pullulanase (see EP 63 909 for details) and/or an acid stable 
10 a-amylase from for example A. niaer (see EP 140 410 for de- 
tails) . 

The liquefaction process of the invention may also be used for 
producing ethanol. In this case the starch is liquefied with a- 

15 amylase at a pH of 3.5-6.0, more preferred 4.0-5.5, followed by 
saccharif ication with glucoamylase and simultaneous or subse- 
quent fermentation with yeast. Thereafter the alcohol may be 
recovered by methods known in the art. Preferably the whole 
process is carried out at pH of about 4.5 without any interme- 

20 diate pH adjustment, and simultaneous saccharif ication and fer- 
mentation is performed at 30-35 «C for up to 96 hours. The liq- 
uefaction can be conducted either at low DS levels (15-20%) or 
high DS levels (20-40%) . In the high DS processes, the DS level 
must be reduced to about 20% prior to fermentation to obtain 

25 about 10% alcohol by volume, which is about maximum that most 
yeast can tolerate. 

The raw material for alcohol production may include refined 
starch such as wet milled corn starch; raw, unprocessed mate- 
30 rials such as corn, wheat, rice, sorghum, cassawa and potato 
(whose starch content range from 15 to 80%) ; and other starch 
containing materials such as waste and by-products from indu- 
stry. 

35 Furthermore, the starch liquefaction may be performed by a 
process comprising 



WO 94/19454 



20 



PCT/DK94/00071 



a) inserting a DNA construct of the invention encoding the 
Pyrococcus a-amylase or a variant thereof, optionally present 
in a suitable expression vector, into a suitable host organism, 

5 b) culturing the host organism in a suitable culture medium 
under conditions permitting expression of the Pyrococcus a- 
amylase or variant thereof, and recovering the resulting 
Pyrococcus a-amylase or variant thereof from the culture, and 

10 c) subjecting an aqueous starch slurry to enzymatic liquefac- 
tion in the presence of the Pyrococcus a-amylase or variant 
thereof recovered in step b) , each of the steps being performed 
as described above. 

15 

BRIEF DESCRIPTION OF THE DRAWINGS 

The present invention is described in the following with refe- 
rence to the appended drawings, in which 

20 

Fig. 1 illustrates the plasmid pSJ 1678, the nucleotide 
sequence of which is shown in SEQ ID No. 7, 

Fig. 2 illustrates the plasmid pSJ 2467, 

25 

Fig. 3 illustrates the plasmid pSJ 2481, 

Fig. 4 illustrates the plasmid pSJ 2482, 

30 Figs. 5 and 6 illustrate the result of the southern analysis 
described in Example 5, 

Fig. 7 illustrates the plasmids pSJ 2487 and pSJ 2488, 

35 Fig. 8 illustrates the plasmids pSJ 2489 and pSJ 2490, 

Fig. 9 illustrates the a-amylase activity of a DNA construct of 
the invention as further described in Example 6, and 
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Figs. 10 and 11 are chromatograms illustrating the distribution 
of oligosaccharides obtained from starch having been exposed to 
an a-amylase encoded by a DNA construct of the invention for 24 
and 48 hours, respectively. 

5 

The following abbreviations are used on the plasxnid drawings: 



15 



rep the gene for the pUBHO Rep protein (Gryczyn et 

al., 1978) 

10 cat the gene for chloramphenicol acetyl transferase 

from pC194 (Horinouchi and Weisblum, 1982) . 
the replication functions from the E. coll cryptic 
plasmid pl5A (Chang and Cohen, 1978) 
the promoter region from the maltogenic amylase 
gene of Bacillus stearothermophllus (Diderichsen 
and Christiansen, 1988) 

the kanamycin resistance gene from pUBHO, deleted 
for its own promoter 

the double-stranded origin of replication for 
2 0 pUBHO 

the alpha-amylase gene from Pyrococcus furlosus 
the beta-lactamase gene from the pUC plasmids 
(Yanish-Perron et al., 1985) 
Plac the beta-galactosidase promoter from the pUC plasmids 
25 'amyA.pfu the amyA.pfu gene truncated in the 5' end 
amyA.pfu' the amyA.pfu gene truncated in the 3 ' end 



p!5A Ori 



PamyM 



kanD 

pUBHO Ori 

amyA.pfu 
bla 



The present invention is further illustrated by the following 
examples which are not in any way intended to limit the scope 
3 0 of the invention as defined herein. 



MATERIALS AND METHODS 
35 Bacteria 

Pyrococcus furlosus DSM 3638 and Pyrococcus woesei DSM 3773 
available from the Deutsche Sammlung von Mikroorganismen und 
Zellkulturen GmbH, Brauschweig, Germany. 
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The strains were grown in medium according to the description 
as supplied from the DSM along with the strains. 

E. coli SJ2 has been described by Diderichsen et al., 1990, and 
5 cells were prepared for and transformed by electroporation 
using a Gene Pulser™ electroporator from BIO-RAD as described 
by the supplier. 

B . subtilis DN1885 has been described by Diderichsen et al., 
10 1990, and competent cells were prepared and transformed as 
described by Yasbin et al., 1975. 

Plasmids 

pSJ1678 (Fig. 1) was used as cloning vector in the construction 
15 of the gene library. The entire sequence of pSJ1678 is given in 
the appended sequence ID No. 7. 

pUC19 (Yanish-Perron et al. , 1985) was used for subclonings. 

2 0 General methods 

The experimental techniques used to construct the plasmids were 
standard techniques within the field of recombinant DNA 
technology, cf. Sambrook et al., 1989. 

25 Restriction endonucleases were purchased from New England 
Biolabs and Boehringer Mannheim and used as recommended by the 
manufacturers. T4 DNA ligase was purchased from New England 
Biolabs and used as recommended by the manufacturer. 

30 Preparation of plasmid DNA from all strains was conducted by 
the method described by Kieser, 1984. 

Amylase activity may be measured as absorption/ml at 620 nm 
using Phadebas tablets (Phadebas® Amylase Test; Pharmacia 
35 Diagnostics, SW) . The assay is carried out for 15 min at 60°C, 
pH 7.3 in the presence of 0.15 nM calcium following the 
procedure described in the Novo Nordisk AF publication AF-207/- 
GB, available upon request. The enzyme activity is compared to 
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that of an enzyme standard and the result is expressed in the 
same unit as that of the enzyme standard (e.g. NU as defined 
herein) . 

5 Media 

Liquid cultures for plasmid preparation were grown in TY media 
supplemented with appropriate antibiotics 



Trypticase 20 g/1 

Yeast Extract 5 g/1 

10 FeCl 2 .4H 2 0 6 mg/1 

MnCl 2 .4H 2 0 1 mg/1 

MgS0 4 .7H 2 0 15 mg/1 

pH 7.3 



15 Liquid cultures for enzyme production and characterization were 
grown in Terrific Broth supplemented with relevant antibiotics 

Bacto-tryptone 12 g 

Bacto Yeast-extract 24 g 

Glycerol 4 ml 

20 Water to 900 ml 

Following autoclaving, 100 ml of the following, separ- 
ately autoclaved solution is added 

KH 2 P0 4 0.17 M 

K 2 HP0 4 0.72 M 

25 

Solid medium was LB agar 

Bacto-tryptone 10 g/1 

Bacto Yeast-extract 5 g/1 

NaCl 10 g/1 

30 Bacto agar 15 g/1 

Adjusted to pH 7.5 with NaOH 



Media for visualization of amylase activity was LB agar 
containing, pr. 500 ml agar, 10 ml of a dyed amylopectin 
35 solution prepared as follows 

12.5 g amylopectin (Serva 2000-4000 kD) is dissolved by boiling 
in 2 50 ml water, cooled to room temperature, 3 0 ml 4 M NaOH and 
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2.5 g Cibacron Rot B are added, and the solution incubated 
overnight. pH is adjusted to 7 with 4 M HCL. 500 ml 96 % 
ethanol is added with stirring to precipitate the amylopectin 
as a red, viscous precipitate. The supernatant is discarded, 
5 the amylopectin dissolved in 2 00 ml water by slight heating, 
and the precipitation with ethanol repeated. The amylopectin is 
again dissolved in 200 ml water, autoclaved, and ready for use. 

10 EXAMPLES 
EXAMPLE 1 

Cloning of the Pvrococcus furiosus a-amvlase gene 
15 Genomic DNA from Pyrococcus furiosus DSM3638 was isolated by 
the method of Pitcher et al., 1989. Approximately 100 /zg DNA 
was partially digested with Sau3A, size fractionated on a 
sucrose gradient, and fragments between 3 and 7 kb were pooled. 

2 0 The cloning vector pSJ1678 was digested with BamHI, and a 3.8 
kb fragment was purified from an agarose gel. Approximately 
0.75 fig vector fragment was ligated to appr. 4 pig size- 
fractionated P. furiosus chromosomal DNA, and used to transform 
E. coli SJ2 by electroporation. 

25 

The gene bank was plated on LB plates containing dyed amylopec- 
tin and supplemented with 10 jug/ml chloramphenicol. Following 
overnight incubation at 37 °C, each plate was replica plated 
onto two new plates which were then incubated overnight at 
30 37 °C. One of these was subsequently incubated at 60°C over- 
night. 

Clear halos indicating degradation of the amylopectin was 
observed around 5 colonies (among a total of 10000) on the 60°C 
35 plates, whereas no halos were observed around the colonies on 
the plates that were kept at 37 °C. These 5 strains taken from 
the 37 °C plates were kept as SJ2463-SJ2467 . 
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Restriction digests revealed that the P. furiosus DNA insert on 
the four clones SJ2463, SJ2464, SJ2465 and SJ2467 shared a 
common DNA region without the inserts being totally identical, 
whereas the DNA contained on pSJ2466 appeared unrelated to 
5 these four clones. pSJ24 67 (Fig. 2) contained an insert of 
approximately 4.5 kb and was used for further analysis. 

EXAMPLE 2 

10 Starch degradation using amylase produced from SJ2467 

E. coll SJ 2467 (the above described E. coli SJ 2 harbouring 
pSJ2467) was grown in Terrific broth medium supplemented with 
6 /xg/ml chloramphenicol for 3 days at 37°C. The E. coli cells 
15 in the culture broth were lysed by sonication. 

One sample was directly sterile filtered, another sample 
sterile filtered after a brief heat-treatment. 

20 2 g waxy corn starch were slurried with 10 ml 0.1 M acetate 
buffer, with pH values of 4.3, 5.0 and 5.5, containing 40 ppm 
Ca ++ , in 180 x 18 mm glass culture tubes. 2 ml heat-treated 
(105°C, 5 min.) E. coll sonicated extract, containing appro- 
ximately 2 NU/ml Pyrococcus furiosus amylase were added and the 

25 pH controlled at ambient temperature. 

The tightly sealed glass tubes were transferred to a thermo- 
stated oil-bath at 105 °C and vigorously agitated. Within 
minutes of gelatinization, the starch was liquified. Samples 
3 0 were taken periodically and tested with iodine (1 drop 0.1 M 
iodine in 5 ml deionized water) . The following results were 
obtained. 



WO 94/19454 



PCT/DK94/00071 



26 



Sample 
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Iod. colour 
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brown 
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0.5 




purple 




6 




red 




24 


5.4 


brown 



5 



This test demonstrates that P. furiosus amylase, expressed in 
E. coll, is active and stable at high temperature and low pH, 
in the presence of starch. 

10 

Furthermore, the test demonstrates that it is possible to 
obtain expression of the extremely thermophilic P. furiosus 
amylase in active form in E. coll. It may be considered 
surprising that the Pyrococcus a-amylase, which in it's native 
15 environment is synthesized, secreted and folded into an active 
three-dimensional structure at 100°C, also may be produced in 
active form at 37 °C. 

In a further experiment 2 g waxy corn starch were slurried with 

2 0 2 ml 1 M acetate buffer, pH 5.5, in a 180 x 18 mm glass culture 

tube. 10 ml E. coll sonicated extract, containing approximately 
1 NU/ml P. furiosus amylase were added and the pH adjusted to 
5.5 at ambient temperature. 

25 The tightly sealed glass tube was transferred to a thermostated 
oil-bath at 105 °C and vigorously agitated. Within seconds of 
gelatinization, the starch was liquefied. Samples were taken 
periodically and tested with iodine (1 drop 0.1 M iodine in 5 
ml deionised water) . Samples were also taken for HPLC analysis. 

3 0 The following results were obtained. 



REPLACEMENT SHEET 
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24 
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yellow 
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The distribution of oligosaccharides seen in the chromatograms 
shown in Figs. 10 and 11 is typical of endo-amylase attack. The 
major oligosaccharides formed on prolonged hydrolysis are DPS, 
10 DP 6 and DP 7 (where DP=degree of polymerization) . 

EXAMPLE 3 

Subclonina of P. furiosus a-amvlase gene 

15 pS2467 was digested with Clal, and the 4.5 kb fragment contain- 
ing the a-amylase gene was ligated to AccI digested pUC19 DNA 
and the ligation mixture transformed into E. coll SJ2 . Trans- 
f ormants were obtained containing the insert in each of the two 
possible orientations with respect to the cloning vectors. 

20 These were SJ2481 containing pSJ2481 (Fig. 3) , and SJ2482 
containing pSJ2482 (Fig. 4) . 

Both clones produce a-amylase as visualized by the appearance 
of clear halos on dyed amylopectin plates after incubation at 
25 60 °C. The amylase-producing transf ormants appear somewhat sick 
as compared to transf ormants containing the pUC19 vector 
plasmid only. They form smaller, more translucent colonies. 

Further subclonings were performed from pSJ2481. pSJ2487 (Fig. 

3 0 7) was constructed by deletion of the 1 kb Xbal fragment from 
PSJ2481 and transformation of the religated plasmid into E. 
coll SJ2. The resulting transf ormants were not able to produce 
halos on LB plates containing dyed amylopectin, indicating that 
this deletion had removed a DNA region of importance for 

35 expression of an active amylase protein. 
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The 1 kb Xbal fragment from pSJ24 81 was inserted into Xbal 
digested pUC19, to give pSJ2489 and pSJ2490 (identical. Fig, 
8) . 

5 

EXAMPLE 4 
DNA sequences 

The ends of the insert on several subclones in pUC19 were 
10 sequenced directly on the double-stranded plasmids using 
Sequenase™ and oligonucleotide primers hybridizing just outside 
the pUC19 multilinker region. 

The resulting sequences are given in SEQ ID Nos. 2, 3, 4, 5 and 
15 6. 

The SEQ ID No. ID 2 is derived from pSJ2490, and read from the 
end of the insert next to the EcoRI site in the pUC poly linker. 

20 The SEQ ID No. 3 is derived from pSJ2489, and read from the end 
of the insert next to the Hindlll site in the pUC polylinker. 

The SEQ ID No. 4 is derived from pSJ2487, and read from the end 
of the insert next to the EcoRI site in the pUC polylinker. 

25 

The SEQ ID No. 5 is derived from pSJ2482, and read from the end 
of the insert next to the EcoRI site in the polylinker. 

The SEQ ID No. 6 is derived from pSJ2482, and read from the end 
3 0 of the insert next to the Hindlll site in the pUC polylinker. 

Based on the DNA sequences given herein, it will be possible to 
synthesize oligonucleotide primers that can be used to amplify 
from chromosomal DNA of Pyrococcus furiosus in a PCR reaction 
3 5 a DNA fragment identical to the entire Pyrococcus furiosus DNA 
insert contained on pSJ2467 and pSJ2481/2482 , thereby recon- 
structing these plasmids from available material. 
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EXAMPLE 5 
Southern analysis 

pSJ2481 was 32 P-labelled by nick-translation using a commercial 
5 kit obtained from Amersham, and used as probe in a southern 
analysis. Hybridization was overnight at 60°C in lOx Denhardts 
solution, 1%SDS, 10 mM EDTA and 5x SSC, followed by two 15 min. 
washes in 2x SSC, 0.1% SDS at room temperature and one 15 min. 
wash at 60 °C. The resulting exposures are shown in Fig. 5 and 
10 Fig. 6. 

Fig. 5 reveals 

1) that pSJ2463, pSJ2464, pSJ2465 and pSJ2467 contains a common 
DNA region as previously stated (a Hindlll fragment of approx. 

15 0.5 kb is common to pSJ2463 (lane 1), pSJ2464 (lane 2), pSJ2465 
(lane 3) , pSJ2467 (laneS) and to chromosomal P. furlosus DNA 
(lane 7) ) . 

2) that the insert on pSJ2481 is derived from the chromosome of 

2 0 P. furlosus. (lane 7) . 

Fig. 6 reveals 

3) that a homologous DNA region exist in the chromosome of P. 
woesei (the chromosomal P. woBsei DNA being isolated by the 

25 method according to Pitcher et al. 1989) . Thus pSJ2481 hybrid- 
izes to exactly the same fragments in Hindlll digested P. 
woesei DNA (lane 2) as in Hindlll digested P. furlosus DNA 
(Fig. 5, lane 7). For clarity, a longer exposure of lane 2 is 
added as lane 0 of figure 6 to reveal the 0.5 kb Hindlll 

3 0 fragment . 

pSJ2481, or the sequence information given herein obtained from 
pSJ2481, can therefore be used as a tool to identify and 
thereby assist cloning of the a-amylase gene from the chromo- 
3 5 some of either P. furlosus or P. woesei. 



WO 94/19454 



30 



PCT/DK94/00071 



EXAMPLE 6 

Expression of the g-amvlase gene in Bacillus subtilis 
The plasmid pSJ1678 used for construction of the gene library 
5 is a shuttle vector able to replicate in both E. coli and B . 
subtilis . 

To test for expression of the amylase activity in B . subtilis, 
pSJ2467 was therefore transformed into competent cells of 

10 DN1885 selecting for resistance to chloramphenicol (6 /xg/ml) on 
LB plates containing dyed amylopectin. 10 transf ormants were 
picked onto two new plates with dyed amylopectin, along with 
SJ1678 which is DN1885/pSJ1678 as a control. After incubation 
overnight at 37 °C one plate was transferred to 65°C / whereas 

15 the other was kept at 37 °C. 7 hours later was degradation of 
the amylopectin around the 10 transf ormants with pSJ2467 
apparent on the plate incubated at 65 °C as formation of a clear 
halo. No halo was formed around the control strain (Fig. 9) . 

20 The fact that expression of amylase activity from a Pyrococcus 
a-amylase gene can be obtained in Bacillus subtilis without any 
modification of the gene, e.g. in form of modification or 
replacement of the ribosome binding site to allow more effi- 
cient initiation of translation, is surprising indeed. Gen- 

2 5 erally, the majority of cloned genes from non-gram positive 
organisms fail to express from their own expression signals in 
B . subtilis (Mountain, A., 1989), and their has to our knowl- 
edge been no prior reports of direct expression of a gene from 
Pyrococcus in B . subtilis. 

30 

EXAMPLE 7 

4.5 kb of the P. furiosus DNA insert cloned on pSJ2467 (Example 
35 1) was sequenced on both strands, using Sequenase™ and a 
combination of subclones and oligonucleotide primers based on 
previously determined sequences. 
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The open reading frame corresponding to the a-amylase gene was 
localized by subcloning (the ability of individual subclones to 
produce a-amylase was assayed on plates containing dyed 
amylopectin) . 

5 

The a-amylase encoded by this open reading frame revealed 
homology to a-amylases and other starch-degrading enzymes from 
a variety of organisms including bacteria, insects and plants. 
When aligned with a B . lichBniformis a-amylase a degree of 
10 identity of about 36% could be observed when 18 gaps were 
introduced at various sites in the sequence. 

The DNA sequence of the a-amylase coding region, including the 
signal peptide coding region, is shown in Seq. ID No. 1. 
15 On the basis of the DNA sequence shown in SEQ ID No. 1 the 
amino acid sequence of the signal peptide (Seq. ID 8) and of 
the mature a-amylase (Seq. ID 9) have been deduced. 

On the map of pSJ2467 (fig. 2) , the a-amylase gene is located 
20 between position 4.5 and 3.0, reading counterclockwise. 



WO 94/19454 



32 



PCT/DK94/00071 



REFERENCES CITED IN THE SPECIFICATION 

S.L. Beaucage and M.H. Caruthers, Tetrahedron Letters 22 , 1981, 
pp. 1859-1869, or the method described by Matthes et al., The 
5 EMBO J. 3, 1984, pp. 801-805. 

Brown et al., Applied and Environmental Microbiology, July 
1990, pp. 1985-1991 

10 Chang, A. C. Y. , Cohen, S. N. (1978). Construction and Amplifi- 
cation of Amplifiable Multicopy DNA Cloning Vehicles Derived 
from the pl5A Cryptic Miniplasmid. J. Bacteriol., 134, 1141- 
1156. 

15 Diderichsen, B. , Wedsted, U. , Hedegaard, L. , Jensen, B. R. , 
Sjoholm, C. (1990) . Cloning of aldB, which encodes a-acetolac- 
tate decarboxylase, an exoenzyme from Bacillus brevis. J. 
Bacteriol. , 172, 4315-4321. 

20 Diderichsen, B. and Christiansen, L. Cloning of a maltogenic a- 
amylase from Bacillus stearothermophilus, FEMS Microbiol. Lett. 
5653-60, 1988. 

Gryczan, T. et al. (1978) . Characterization of Staphylococcus 
25 aureus plasmids introduced by transformation into Bacillus 
subtilis. J. Bacteriol., 134, 318-329. 

Horinouchi et al. (1982b) "Nucleotide sequence and functional 
map of pE194, a plasmid that specifies inducible resistance to 
30 macrolide, lincosamide, and streptogramin type B antibiotics" 
J. Bacteriol., 150, 804-814. 

Hudson et al., 1989, Practical Immunology, Third edition 
(1989) , Blackwell Scientific Publications, 

35 

Kieser. T. (1984) . Factors affecting the isolation of CCC DNA 
from Streptomyces llvidans and Escherichia coli. Plasmid 12, 
19-36. 



WO 94/19454 



33 



PCT/DK94/00071 



Koch et al., FEMS Microbiology Letters 71 (1990) 21-26 

Lipman and Pearson (1985), Science 227, 1435 (1985) 

5 Matthes et al., EMBO J. 3, 1984, pp. 801-805 

Mountain, A. (1989). In "Biotechnology Handbooks Vol. 2. 
Bacillus" Ed. Harwood, C. R. Plenum Press, New York, 1989, pp 
73-114. 

10 

Pitcher, D. G. , Saunders, N. A., Owen, R. J. (1989). Rapid 
extraction of bacterial genomic DNA with guanidium thiocyanate. 
Lett. Appl. Microbiol., 8, 151-156. 

15 Podkovyrov, Journ. of Bacterid., 1992, Vol. 174, pp. 5400- 
5405. 

R.K. Saiki et al., Science 239 . 1988, pp. 487-491. 

20 Sambrook et al., Molecular Cloning A laboratory manual, 2nd 
Ed., Cold Spring Harbor, 1989. 

Svensson, B. , FEBS Letters, 1988, Vol. 230, pp. 72-76. 

25 Yanish-Perron, C, Vieira, J., Messing, J. (1985). Improved Ml 3 
phage cloning vectors and host strains Nucleotide sequences of 
the M13 mpl8 and pUC19 vectors. Gene 33, 103-119. 

Yasbin, R. E. , Wilson, G. A., Young, F. E. (1975). Transfor- 
3 0 mation and transfection in lysogenic strains of Bacillus 
subtilis Evidence for selective induction of prophage in 
competent cells. J. Bacterid., 121, 296-304. 

Zhou, FEBS Letters, 1989, Vol. 255, pp. 37-41. 



WO 94/19454 



PCT/DK94/00071 



34 

SEQUENCE LISTING 

(1) GENERAL INFORMATION 

(i) APPLICANT 

(A) NAME NOVO NCRDISK A/S 

(B) SHEET Novo Alle 

(C) CITY Bagsvaerd 

(E) OCXMERY DENMARK 

(F) POSTAL CODE (ZIP) EK-2880 

(G) TELEHKNE 445 44448888 

(H) TELEFAX +45 4449 3256 

(I) TELEX 37304 

(ii) TITLE OF INVENTION An anxiolytic enzyme 
(iii) NUMBER OF SEQUENCES 52 

(iv) COMPUTER READABLE FOFM 

(A) MEDIUM TYPE Floppy disk 

(B) COMPUTER IBM PC oanpatlble 

(C) OPERATING SYSTEM PC-DOS/MS-DOS 

(D) SOFTWARE Patentln Release #1.0, Version #1-25 (EPO) 
Novo Nardisk A/S 

(2) INFORMATION FOR SEQ ID NO 1 
(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH 1380 base pairs 

(B) TYPE nucleic acid 

(C) SIRANDECNESS single 

(D) TOPOLOGY linear 

(ii) M3LECULE TYPE ENA (genomic) 
(vi) ORIGINAL SOURCE 

(A) ORGANISM Pyroooocus furiosus 

(B) STRAIN D£M 3638 

(xi) SEQUENCE DESCRIPTICN SEQ ID NO 1 

GIGAAGATAA AGAAATTAAC AOCXTTOCTA ACICTATTAC TUiTTlTJAT AGTACTAGCA 60 

AGTGCAGTAA GTGCAGCAAA ATAdTGGAG CITGAAGAGG GAGGAGTTAT AATGCAAGGA 120 

TTCTATTGGG AICTTOCAGG GGGAGGAATT TX3GIGQGATC ATATAAGATC GAAGATKCT 180 

GAAIGGTAIG AAGCT9GAAT CTCIGCAATA TGGCTAOCTC CAOCAAGCAA GGGGATGAGT 240 

GGAGGATATT CAATGGGCTA CX3AIODCTAT GATTACTTTG ATCI03GCGA CTACEAOCAG 300 

AAGGGAACIG TAGAGAOGOG TTTTGGATCA AAAGAAGAAC TAGIGAGATT GATACAAACT 360 

GCOGATQOCT AIGGAATAAA GGTAATOGOC GAIGTAGITA TAAAOCAGAG GGCTGGTGGT 420 

GACCTAGAAT GGAAOOXTT OGITGGAGAT TACACATOGA CEGACITTTC TAAAGITGOC 480 

TCAGGGAAAT ATACAGCTAA CTATCIGGAC TTXXAIOCAA AOGAGCITCA TTGTTGIGAC 540 

GAAGGAAOCT TTX3GAGGATT TOCAGATATA TCTCATCACA AAGAGTGGGA TCAGTACTGG 600 

CTATGGAAGA GCAATGAGAG TTAIGCIGCT TATTTAAGAA GCATAGGATT TX3ATQGTIGG 660 

AGATTTGACT ATCTTAAGQG CTATGGAGCT TGGGTIGICA GAGACTOGCT TAATTQGIGG 720 

GGAGGTTGGG CAGTIGGAGA GTACIGQGAC ACAAAT3TAG ATGCACTACT AAGCTQGGCA 780 

TAIGAGAGTC GIGCAAAGCT CTTIGACITC OCGCTCTACr ATAAAATGGA TGAAGCATTT 840 

GACAATAACA AGATTOCAGC ATTAGKTftT GOOCIACAAA AOQGACAAAC TGTAGITTOG 900 



AGAGATOCAT TTAAGGCAGT AACTITCCTT GOCAATCAIG ACACAGATAT AAIATQGAAC 
AAGIATOCAG CMMGCCTT CAIATIGACA TATGAGGGAC AGOCAGIAAT Al'ICilAGAGG 
GACTTIGAGG AAIGGCIGAA CAAGGATAAG CTAATEAACC TCATITGGAT CX2AIGATCAT 
TIGGCAGGAG GAAGCACAAC AATICTCTAC TAOGACAAOG AIGAGCIGAT ATITCIGAGA 
AATOGAGATT CEAGAAGGOC T3GGCTTA3A ACTIACA1TA ACTIGAGCOC TAACDQGCTT 
GGTAGGIGGG TATACGITOC AAAGITTGCA GQQGCT1U1A TICA1GAATA CACIGGAAAC 
CTAGGAGGAT GGGTAGATAA AAGAGTAGAT AGTAGOGGAT GGGTATAOCT AGAGGCAOCA 
OCTCAOGATC CAGCIAAOGG CTACIAIGGG TACTOOGTAT GGAGTEATIG IGGTEGITGGG 

(2) INFORMATION FOR SEQ ID NO 2 
(i) SEQUENCE OiARACTERISITCS 

(A) UENGIH 303 base pairs 

(B) TYPE nucleic acid 

(C) SIRANDEENESS single 

(D) TOPOIGGY linear 

(ii) MOODCDLE TYPE ENA (geranic) 
(vi) ORIGINAL SOURCE 

(A) ORGANISM Pyroooocus furiosus 

(B) SIRAIN DSM 3638 

(xi) SEQUENCE DESCRIPTION SEQ ID NO 2 



TCTAGAATCT CCATITCICA CAAATATCAG 
GCTTOCTOCT GOCAAATGAC TATQGATOCA 

ocamocrcA aagtoocict agaaiattac 

ATATGCIQGA TACITCITOC ATAITATATC 
TEAAATGGAT CTCTOGAAAC TACAGlTIUr 
GAA 



CICATCGTIG TOGTAGTAGA CAAi'lGTlUr 
AATCAGGTTA ATTAGCITAT OLTIUI'ICAG 
TOGCTGTOOC TCATAIGKA ATATGAAOGC 
TCTCTCAIGA ITGGCAAOGA AAGUACIGOC 
AAOCTITCTA GGGCATAGAC TAAITAGCIG 



(2) INFORMATION FOR SEQ ID NO 3 
(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH 117 base pairs 

(B) TYPE nucleic acid 

(C) SIRANDEENESS single 

(D) TOPOLOGY linear 

(ii) MOLECULE TYPE ENA (genonic) 
(vi) ORIGINAL SCONCE 

(A) ORGANISM Pyrocoocus furiosus 

(B) STRAIN DSM 3638 

(xi) SEQUENCE DESCRIPTION SEQ ID NO 3 

GATCA03IGA ACATAAAGAA ATEAAGACOC CICCTAACIC TATTACICTT TITEATAGIA 
CTAGCAAGTC CAGEAGIGCA GCAAAATACT TOGAGCITGA AGAGGGANGA GITATAA 

(2) INFORMATION FOR SEQ ID NO 4 
(i) SEQUENCE CHARACTERISTICS 

(A) LENGIH 207 base pairs 

(B) TYPE nucleic acid 

(C) S3RANDEDNESS single 



(D) TOPOLOGY linear 
(ii) MOLECULE TYPE ENA (genomic) 
(vi) ORIGINAL SOURCE 

(A) ORGANISM Pyrocxxjcus furiosus 

(B) STRAIN DSM 3638 

(xi) SEQUENCE DESCRIPTION SEQ ID NO 4 

TCTAGAAGGC CIGQGCITAT AACTTACATT AACITGAGOC CTAACIGGCT TGGIAGG7IGG 
GTATACTIOC AAAGITTGCA GGGGCITCTA TCAIGAATAC AOOGAAADCT AGGAGGATGG 
GAGAIAAAAG AGTAGATAGT AGOGGATCGG TATACCTAGA GGGAOCAOCT CAOGATOCAG 
CEAAOQGCTA CEA3X3GGTAC TOOC3EAT 

(2) INFORMATION FOR SEQ ID NO 5 
(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH 237 base pairs 

(B) TYPE nucleic acid 

(C) SIRANDEENESS single 

(D) TOPOLOGY linear 

(ii) MOLECULE TYPE ENA (genomic) 
(vi) ORIGINAL SOURCE 

(A) CRGANI»1 Pyroooacus furiosus 

(B) STRAIN DEM 3638 

(xi) SEQUENCE DESCRIPTION SEQ ID NO 5 

GATOCAAAGT CTTATCIOGA AAIGGCTAGA ACAAEAOGTC TGAAGAAATT GGGACATCIT 
TTGTT3ATATC AGTATGQGTA CITGATTAOG AAAATAAAAA GCICIACAGA GGATICACTA 
TACTGAATTA TGAAATCAAG G&CAIGAGAA AGGGGTTCAA AAAAATAGIT AAGGTAAATA 
TTCACAAACT AOOCIXXftGC GAACTTGGAT CTAATAGAAC AGCAIOGATT TCAAGGG 

(2) INFORMATION FOR SEQ ID NO 6 
(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH 192 base pairs 

(B) TYPE nucleic acid 

(C) SIRANDEENESS single 

(D) TOPOLOGY linRar 

(ii) MOLECULE TYPE ENA (genomic) 
(vi) ORIGINAL SOURCE 

(A) ORGANISM Pyroooacus furiosus 

(B) STRAIN DSM 3638 

(xi) SEQUENCE DESCRIPTION SEQ ID NO 6 

GATCAOGTCA ACAEAAAGAA ATTAACNOOC CTOCTAACTC TATTACTGTT TIT'IIATAGTA 
CTAGCAAGTC CACTAAGIGC AGCAAAATAC TTOGAGCTTG AAGAGGGAGG AGTTAEAATC 
C&AGGATTCT AUTGGGATCT TOCAGGNQGA GGATTTGGIG GGAICATATA AGATOGAAGA 
TTOCTGAATG GG 



(2) INFORMATION FOR SEQ ID NO 7 
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(i) SEQUENCE CHAE^CTERISITCS 

(A) LENGIH 4679 base pairs 

(B) TYPE nucleic acid 

(C) SIRANDEENESS single 

(D) TOPOLOGY linear 

(ii) MDLECULE TYPE ENA (gencmic) 

(xi) SEQUENCE DESCRIFITON SEQ D NO 7 

GAAHTOCGGC OCAAOGATGG CTGA2TIO0G GGEIGAOQGC OQGOQGAAOC AAGGGGTCAT 60 

GGCT0GGO3G AAATCAAGGC CIGCX3GCEAG TGCQOGCXTT CICTITTCAG GAIEATAATC 120 

AGAGEAEATE GAAAGTTTCG CGATCT1TIC GEAIAATTCT TTEAGGGAEA GTCCAATOGA 180 

TAAGCITQGC TCCAGGIOGA O3GATO000G GGTAOOCA3T CEEATEEAGA AAAGCAAATC 240 

TAAAATEATC TCAAAAGQGA AIGAGAATAG TCAA3X3GAOC AATAAEAAIG ACEAGAGAAG 300 

AAAGAAIGAA GA3TCITCAT GAAATEAAGG AAOGAAEATE GGATAAATAT GQGGATCATC 360 

TEAAGGCEAT TOGIGITEAT GGCTCICTIG GDOCTCAGAC TCATOGGOOC TATKX3GA2A 420 

TTGAGA1GAT GIUIUTCATC TCAACAGAGG AAGCAGAdT CAGCXATCAA TGGACAAOOG 480 

GIGAGIGGAA GGIGGAAGIG AALL'iTlGAEA GOGAAGAGAT TCEACEAGAT TAIGCATCIC 540 

AGGIGGAATC AGATIGGOOG CTEACAGKIG GICAAT37TIT CICEATTITG 00GA3TEAIG 600 

ATTCAGGIGG ATACTEAGAG AAAGIGEATC AAACIGCTAA AK33GTAGAA GOOCAAAOGT 660 

TOCAOGAIGC GATTIGIGOC CITATOGTAG AAGAGCIGTT TGAAEAUGCA GGCAAAIGGC 720 

GTAATA3TOG TCIGCAAGGA OOGAGAACAT TICEAOCATC CTIGACIGEA CAGGIAGCAA 780 

TGGCAGGIGC CAICTTGATT GGICIGCATC A3XX3CATCIG TEAJEAOGAOG AGOGdTOGG 840 

TCTEAACIGA AGGAGTTAAG CAATCAGATC TTCXTTCAGG TEA1GAOCAT CIGIGOCAGE 900 

TOGEAATCTC TOGTCAACTT TODGACICIG AGAAACITCE GGAAIOGCEA GAGAATITCT 960 

GGAAIGGGAT TCAGGAGIGG AGAGAAOGAC A03GATATAT AGIGGATCIG TCAAAAOGCA 1020 

TACTAiTl'lG AAOGAIGAOC TCEAAIEAATE GTEAATCATC TIGGEEAOQG GGATOOCTOG 1080 

AOCIGCAGOC AAGCITATO3 ATEGCACEAT GOCEAAAACA ATEATAOGAA AAGATOGOGA 1140 

AACTITCAAT ATACICIGAT TATAATOCTC AAAACAGAAG GOOOGCACIC GOOGCAGGOC 1200 

TTCAITTOOG O0GAO0GATC AO00CTT Q CT T00GO0QGOC GTCAA00CX3G AAATCAGOCA 1260 

TCXJETGQGOC GGAAITAGAT CTAGCAIGCC TITEAGIOCA GAOCAAAATC CXTEAAOGIG 1320 
AGITTTOGflT OCACIGAGOG TCAGAGOOCT TAAEAAGATC ATCITCITCA GATO5ITTDG - 1380 

GTCIGOGCCT AATCICITGC TCTGAAAAOG AAAAAADOGC CITCGAGGGC GCJITITIOGA 1440 

AGGITCICIG AGCTAOCAAC TCITIGAAOC GAGGEAACIG GCT3X3GAGGA GOGCAGICAC 1500 

CAAAAdTGT OCHTICAGIT TAGOCEEAAC 03GOGCATCA CTTCAAGACE AACTOCTCEA 1560 

AATCAATEAC CAGIGGCEGC TCCEAGIQGE GCl'ITlGCAT GTCITiaOQG GITGGACrCA 1620 

AGAOGATAGE TAOOGGATAA GGOGCAGOQG TOQGACIGAA CGQGGGGTIC GTCCA3ACAG 1680 

TCCAGCITGG AGCEAACIGC CTAOOOQGAA CTCAGEGTCA GGOCTQGAAT GAGACAAAOG 1740 

OGGCXATAAC AGOGGAAIGA CAOOGGEAAA OOGAAAGGCA GGAACAGGAG AGOGCAOGAG 1800 

GGAGOOGOCA GGGGGAAAOG OCTGGEATCE TEATAGEOCE CTOQQGITIC GCX3\OCACIG 1860 

ASTEGAGOCT CAGATITOGE GATCCITCTC AGGGGGGCGG AGOCEATQGA AAAAOGGCIT 1920 

TC00GO3G0C CICICACTTC OCICTEAAGE ATCITOCTGG CATCTKXAG GAAATCTOCG 1980 

CCCCGTTCGT AAGOCATTIC OGCTOGOOGC ACTCGAAOGA CGGAGOGEAG OGAGTCAGTO 2040 

AGOGAGGAAG CX3GAATATAT GCIGEATCAC A3A3TCIGCE GAOGCAOOGG TGCAGOCEIT 2100 

TITCTOCTGC CACATCAAGC ACITCACTGA CAOCXTTCATC AdGOCAACA TAGEAAGOCA 2160 

GEATAGACTC CXKTCATITC ACITTEIGCA TECEAOGGAC TGCATAACTC ATAIGTAAAT 2220 

OGCTCXJiTlT TAGGIGGCAC AAATCIGAGG CATEITOGCE CITIOCGGCG AGGCEAGTEA 2280 

CXXITEAAGIT ATEGGEAIGA CIGGTITEEAA GOGCAAAAAA AGEIGCiTXT TOSEAOCEAT 2340 

TAATCEAIXDG TEAGAAAAOC GACIGEAAAA AGEAGAGTOG GCATEATCTC AEATEATAAA 2400 



WO 94/19454 



PCT/DK94/00071 



38 

AGOCAGTCAT TAGGOCEATC TCACAMTOC TCAATAGACT TCAIAAACAA TOCIGCAIGA 2460 

TAAOCATCAC AAACAGAATC ATCTAOCTCT AAAGATAGCG GTAAATATAT TGAATTAOCT 2520 

TIATEAATGA AITTIOCIGC TCIAATAATC GCTAGAAGCT AATIACTATr MTATIGATA 2580 

TITAAGTIAA AOOCACTAAA TGAAGTOCAT GGAATAAIAG AAAGAGAAAA AGCATlTlCA 2640 

QGEATAGCTG TTTIGGGAAA CAA1T1UJUC GAAOCAITAT ATITCTCICAC ATCAGAAAGG 2700 

TAIAAATCAT AAAACTCTIT GAAGICATTC TITACAGGAG TOCAAATAOC AGAGAATGIT 2760 

TIAGATACAC CATCAAAAAT TGTAIAAACT GGCTCTAACT TA3O0CAAIA AOCTAACTCT 2820 

CDCnOGCTAT TGEAAOCAGT TCEAAAAGCT GTATITGAGT TEATCAOCCT TCICACTAAG 2880 

AAAATAAATG CAGGGIAAAA TITATATOCT TLTIUITI'IA TCECTOQGEA. TAAAACACEA 2940 

ATATCAAJXTT CTSTOGrTAT ACTAAAAGIC ClTlUriUGT TCAAAIAATC AITAAATATC 3000 

TC1T11CICT TOCAATIGIC TAAATCAA3T TTAITAAACT TCATITGATA TCOOTOCTAA 3060 

A1T1T1ATCT AAAGIGAATT TAGGAGGCTT ACTIGTCTGC Tl'lCI'lCATT AGAATCAATC 3120 

CTTTITTAAA AGICAATATT ACICTAACAT AAATATATAT TTEAAAAATA TOCCACTTTA 3180 

TCTAATATIC GTrCCTTAAT TPCAIGAACA. AICTICATTC TITCTICICT AGTCATTATT 3240 

AJT G GKX3C& GATCIGGTIG AACTACICIT TAATAAAATA A1T1T1UJUT TOXAASTOC 3300 

AGATIGGAAT AATAGAAAAT OCB2CTICAT OQGC1T1T1C GICATCATCT GTATCAATCA 3360 

AATOGCCTIC TTCItSDGfTCA TCAAGCHTTA ATITITTAIG TALLTICITIT AAGAAAOCAC 3420 

CATAGGAGAT TAAOCITTTA CX3GICTAAAC CTICCTOCAA ATCAGACAAA OSTTTCAAAT 3480 

TLTlTlCriC ATCATOQGTC ATAAAATOCG TA3XXTTEAC AGGAIAITIT GGAGTi'ltUl 1 3540 

CAAITCOOGA TICTATATCC GA3TTATA3T TAIiTl'l'lOGG TOGAAICATT TCAACITITA 3600 

CAITIGGATC ATAGICTAAT TTCAITGOCT TITTOCAAAA TIGAATOCAT lUlTlTlGAT 3660 

TCACCTAGIT TICTCTAITC TTAAAATAAG TTOGTTTOCAC ACATAOCAAT ACAIGCATCT 3720 

GCTGATTA3A AGAATEATCT TEATEATITA T1U1XACITC CCTTGCAOGC ATAAAAOCAA 3780 

CAAGATITIT ATIAAT1T1T TIMATIGCA TCATTOQGOG AAATOCITGA GCCATAICIG 3840 

ACAAACTCTr AITrAA!lTCr TCECO^TCAT AAAGALLTITI' AACIGITAAT GIGAGAAACA 3900 

AOC^AOGAAC TCITOGCTIT 1U1T1AATAA CITCAGGAAC AACCTITTGT GACIGAAIGC 3960 

CAIGITICAT TGCICTOCIC CAGTIGCACA TTGGACAAAG CXJDQGATTTA CAAAAOCACA 4020 

CHXGATACAA CTITCITTOG (XTIGTTICAC GAITITGITT AEACTCEAAT AITrCAGGAC 4080 

AATCTTITAC TCTITCAGOC Tl ' lT JA AATT CAAGAATATC CAGAAGITCA AAGEAATCAA 4140 

CATTAGOGAT I'l'lClTl'lCT CIOCA33QGTC TCALT1T1CC ACTI'ITIGTC TTCTCGACEA 4200 

AAACXXTPGA TlTl'lCATCT GAATAAATCC TACTATIAGG AGACATAATA TTAAAAGAAA 4260 

OOCOCATCIA TITAGITATr TCTTEAGTCA CTEATAACIT TAACAGATQG GCJ1T1T1C1G 4320 

TGCAACCAAT TTEAAGGGTT TTCAATACTT TAAAAGAGAT ACATAOCAAC ACTItAAOGC 4380 

AOCTTTCAGC AACIAAAATA AAAAIGAOGT TATTTCIATA TCTATCAAGA. TAAGAAAGAA 4440 

CAAGITCAAA AOCATCAAAA AAAGAGAOCT TITCAGGTCC TI'ITITISAIT TIAIAAACTC 4500 

ATIGGGIGAT CTOGACTIOG TTLTmTlT ACXTCKX3GT TATCAGITAG TICAAATIOG 4560 

TTLTl'lTIAG GITCEAAATC GIUITITICT TOGAATIGTC CIGTITTATC CTTTAOCTIG 4620 

TCTACAAAOC OCITAAAAAC GITITIA AAG GCT1T1AAGC OSICICTAOG TIOCTTAAG 4679 

(2) INFORMATION FGR SEQ ID ND 8 
(i) SEQUENCE CHARACTERISriCS 

(A) LENGIH 25 amino acids 

(B) TYPE amino acid 
(D) TOPOLOGY linrerr 

(ii) MDLEJCULE TYPE protein 

(xi) SEQUENCE DESCKEPTICW SEQ ID NO 8 

ltet Asn lie Lys Lys leu Thr Pro Lai Leu Thr Lai Leu Leu Phe Fhe 
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1 



5 



10 



15 



lie Val Leu Ala Ser Pro Val Ser Ala 
20 25 



(2) INHX^AITC^ FOR SEQ ID NO 9 
(i) SEQUENCE CHARACTERISTICS 

(A) LENCZEH 435 amino acids 

(B) TYPE amino acid 
(D) TOPOLOGY linear 

(ii) MXECULE TYPE protein 

(xi) SEQUENCE DESOHPTICN SEQ 3D NO 9 

Ala Lys Tyr Leu Glu Lai Glu Glu Gly Gly Val He Met Gin Ala Fhe 
15 10 15 

Tyr Trp Asp Val Pro Gly Gly Gly He Trp Trp Asp His He Arg Ser 
20 25 30 

Lys He Pro Glu Trp Tyr Glu Ala Gly He Ser Ala He Trp Leu Pro 
35 40 45 

Pro Pro Ser Lys Gly Met. Ser Gly Gly Tyr Ser Met Gly Tyr Asp Pro 
50 55 60 

Tyr Asp Tyr Fhe Asp Leu Gly Glu Tyr Tyr Gin Lys Gly Thr Val Glu 
65 70 75 80 

Thr Arg Hie Gly Ser Lys Glu Glu Leu Val Arg Leu He Gin Thr Ala 
85 90 95 

His Ala Tyr Gly He Lys Val He Ala Asp Val Val lie Asn His Arg 
100 105 110 

Ala Gly Gly Asp Leu Glu Trp Asn Pro Fhe Val Gly Asp Tyr Thr Trp 
115 120 125 

Thr Asp Fhe Ser Lys Val Ala Ser Gly Lys Tyr Thr Ala Asn Tyr Leu 
130 135 140 

Asp Fhe His Pro Asn Glu Leu His Cys Cys Asp Glu Gly Thr Fhe Gly 
145 150 155 160 

Gly Phe Pro Asp He Cys His His Lys Glu Trp Asp Gin Tyr Trp Leu 



Trp Lys Ser Asn Glu Ser Tyr Ala Ala Tyr Leu Arg Ser lie Gly Phe 



165 



170 



175 



180 



185 



190 



WO 94/19454 



40 



PCT/DK94/00071 



Asp Gly Itp Arg Phe Asp Tyr Val Lys Gly Tyr Gly Ala Trp Val Val 
195 200 205 

Arg Asp Trp Leu Asn Trp Trp Gly Gly Trp Ala Val Gly Glu Tyr Trp 
210 215 220 

Asp Bit Asn Val Asp Ala Leu Lai Ser Trp Ala Tyr Glu Ser Gly Ala 
225 230 235 240 

Lys Val Phe Asp Phe Pro Leu Tyr Tyr Lys Met Asp Glu Ala Phe Asp 
245 250 255 

Asn Asn Asn lie Pro Ala Leu Val Tyr Ala Leu Gin Asn Gly Gin Thr 
260 265 270 

Val Val Ser Arg Asp Pro Phe Lys Ala Val Ihr Hie Val Ala Asn His 
275 280 285 

i 

Asp Ihr Asp lie lie Trp Asn Lys Tyr Pro Ala Tyr Ala Phe lie Leu 
290 295 300 

Ihr Tyr Glu Gly Gin Pro Val lie Hie Tyr Arg Asp Phe Glu Glu Trp 
305 310 315 320 

Leu Asn Lys Asp Lys Leu lie Asn Leu lie Trp lie His Asp His Leu 
325 330 335 

Ala Gly Gly Ser Ihr Ihr lie Val Tyr Tyr Asp Asn Asp Glu Leu lie 
340 345 350 

Hie Val Arg Asn Gly Asp Ser Arg Arg Pro Gly Leu lie Ihr Tyr lie 
355 360 365 

Asn Leu Ser Pro Asn Trp Val Gly Arg Trp Val Tyr Val Pro Lys Phe 
370 375 380 

Ala Gly Ala cys lie His Glu Tyr Ihr Gly Asn Leu Gly Gly Trp Val 
385 390 395 400 

Asp Lys Arg Val Asp Ser Ser Gly Trp Val Tyr Leu Glu Ala Pro Pro 
405 410 415 



His Asp Pro Ala Asn Gly Tyr Tyr Gly Tyr Ser Val Trp Ser Tyr Cys 
420 425 430 



Gly Val Gly 
435 
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(2) 3OTOKMATION FOR SEQ ID NO 10 

(i) SEQUENCE ojaracteristics 

(A) LENGTH 10 amino acids 

(B) TYPE amino acid 
(D) TOPOLOGY linear 

(ii) MOLECULE TYPE protein 

(xi) SEQUENCE DESCRIPTION SEQ ID NO 10 

Ala Lys Tyr Lai Glu Leu Glu Glu Gly Gly 
15 10 

(2) INFORMATION FOR SEQ 3D NO 11 
(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH 10 amino acids 

(B) TYPE amino acid 
(D) TOPOLOGY linear 

(ii) MOLECULE TYPE protein 

(Xi) SEQUENCE DESCRIPTION SEQ ID NO 11 

Val lie Mat Gin Ala Hie Tyr Trp Asp Val 
15 10 

(2) 32JFCBMA1TCN FOR SEQ ID NO 12 
(i) SEQUENCE CHARACTERISTICS 

(A) LENGTEH 10 amino acids 

(B) TYPE amino acid 
(D) TOPOLOGY linear 

(ii) MOLECULE TYPE protein 

(xi) SEQUENCE DESCRIPTION SEQ ID NO 12 

Pro Gly Gly Gly lie Trp Trp Asp His lie 
1 5 10 

(2) INFORMATION FOR SEQ ID NO 13 
(i) SEQUENCE CHARACTERISTICS 

(A) LENGIH 10 amino acids 

(B) TYPE amino acid 
(D) TOPOLOGY linear 

(ii) MOLECULE TYPE protein 

(xi) SEQUENCE DESCRIPTION SEQ ID NO 13 

Arg Ser Lys lie Pro Glu Trp Tyr Glu Ala 
15 10 

(2) INFORMATION FOR SEQ 3D NO 14 
(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH 10 amino acids 
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(B) TYPE amino acid 

(D) TOPOLOGY liTwrr 
(ii) MOLECULE TYPE protein 
(xi) SEQUENCE DESCRIPTION SBQ ID NO 14 

Gly lie Ser Ala lie Trp Lai Pro Pro Pro 
15 10 

(2) INFCX31AIT0N FOR SEQ ID NO 15 
(i) SEQUENCE CHARACTERISTICS 

(A) LENGIH 10 amino acids 

(B) TYPE amino acid 
(D) TOPOLOGY linear 

(ii) MOLECULE TYPE protein 

(xi) SEQUENCE DESCRIPTION SEQ ID NO 15 

Ser Lys Gly Met Ser Gly Gly Tyr Ser Met 
15 10 

(2) INFORMATION FOR SEQ ID NO 16 

(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH 10 amino acids 

(B) TYPE amino acid 
(D) TOPOLOGY linear 

(ii) MOLECULE TYPE protein 

(xi) SEQUENCE DESCRIPTION SEQ ID NO 16 

Gly Tyr Asp Pro Tyr Asp Tyr Phe Asp Leu 
1 5 10 

(2) INFORMATION FOR SEQ ID NO 17 
(i) SEQUENCE CHARACTERISTICS 

(A) LENGIH 10 amino acids 

(B) TYPE amino acid 
(D) TOPOLOGY linear 

(ii) MDLEX3JLE TYPE protein 

(xi) SEQUENCE DESCRIPTION SEQ ID NO 17 

Gly Glu Tyr Tyr Gin Lys Gly Thr Val Glu 
15 10 

(2) B1KDEMATI0N FDR SEQ ID NO 18 
(i) SEQUENCE CHARACTERISTICS 

(A) LENGIH 10 amino acids 

(B) TYPE amino acid 
(D) TOPOLOGY linear 

(ii) MDLECULE TYPE protein 

(xi) SEQUENCE DESCRIPTION SEQ ID NO 18 
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Thr Arg Phe Gly Ser Lys Glu Glu Leu Val 
15 10 

(2) INFCmNTIOK FOR SEQ ID NO 19 
(i) SEQUENCE CHARACTERISTICS 

(A) LENG7IH 10 amino acids 

(B) TYPE amino acid 
(D) TOPOLOGY linear 

(ii) MOLECULE TYPE protein 

(xi) SEQUENCE DESCRIPTION SEQ ID NO 19 

Arg Leu He Gin Thr Ala His Ala Tyr Gly 
15 10 

(2) INPCPMA3TCN FOR SEQ ID NO 20 
(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH 10 amino acids 

(B) TYPE amino acid 
(D) TOPOLOGY linear 

(ii) MOLECULE TYPE protein 

(xi) SEQUENCE DESCKEPTICN SEQ 3D NO 20 

He Lys Val He Ala Asp Val Val He Asn 
15 10 

(2) INPCKMATICN FCR SEQ ID NO 21 
(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH 10 amino acids 

(B) TYPE amino acid 
(D) TOPOLOGY linear 

(ii) MDLECULE TYPE protein 

(xi) SEQUENCE DESCRIPTION SEQ ID NO 21 

His Arg Ala Gly Gly Asp Leu Glu Trp Asn 
15 10 

(2) INFORMATION FOR SEQ 3D NO 22 
(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH 10 amino acids 

(B) TYPE amino acid 
(D) TOPOLOGY linear 

(ii) M3LBCULE TYPE protein 

(Xi) SEQUENCE DESCRIPTION SEQ 3D NO 22 

Pro Phe Val Gly Asp Tyr Thr Trp Thr Asp 
15 10 

(2) INFORMATION FOR SEQ 3D NO 23 
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(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH 10 amino acids 

(B) TYPE amino acid 
(D) TOPOLOGY linear 

(ii) MOLECULE TYPE protein 

(Xi) SEQUENCE DESCRIPTION SEQ ID NO 23 

Hie Ser Lys Val Ala Ser Gly Lys Tyr Thr 
15 10 



(2) INKSMATICN FOR SEQ JD NO 24 
(i) SEQUENCE CHARACTERISTICS 

(A) LENGIH 10 amino acids 

(B) TYPE amino acid 
(D) TOPOLOGY linear 

(ii) MOLECULE TYPE protein 

(xi) SEQUENCE DESCKEPTICN SEQ ID NO 24 

Ala Asn Tyr Leu Asp Phe His Pro Asn Glu 
15 10 



(2) INFORMATION FOR SEQ ID NO 25 
(i) SEQUENCE CHARACTERISTICS 

(A) LENGIH 10 amino acids 

(B) TYPE amino acid 
(D) TOPOLOGY linear 

(ii) MOLECULE TYPE protein 

(xi) SEQUENCE DESCRIPTION SEQ ID NO 25 

Leu His cys Cys Asp Glu Gly Ihr Phe Gly 
15 10 



(2) INFORMATION FOR SEQ ID NO 26 
(i) SEQUENCE OJARACTERISTICS 

(A) LENGIH 10 amino acids 

(B) TYPE amino acid 
(D) TOPOLOGY linear 

(ii) MDLBCULE TYPE protein 

(xi) SEQUENCE DESCRIPTION SEQ ID NO 26 

Gly Phe Pro Asp lie Cys His His Lys Glu 
15 10 



(2) INFORMATION FOR SEQ ID NO 27 
(i) SEQUENCE CHARACTERISTICS 

(A) LENGIH 10 amino acids 

(B) TYPE amino acid 
(D) TOPOLOGY linear 
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(ii) MOLECULE TYPE protein 

(xi) SEQUENCE DESCRIPTION SEQ ID NO 27 

Trp Asp Gin Tyr Trp le\i Trp Lys Ser Asn 
15 10 

(2) INFOPMATION FOR SEQ ID NO 28 
(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH 10 amino acids 

(B) TYPE amino acid 
(D) TOPOLOGY linear 

(ii) MOLECULE TYPE protein 

(xi) SEQUENCE DESCRIPTION SEQ ID NO 28 

Glu Ser Tyr Ala Ala Tyr Lai Arg Ser lie 
1 5 10 

(2) INFORMATION FOR SEQ 3D NO 29 
(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH 10 amino acids 

(B) TYPE amino acid 
(D) TOPOLOGY linear 

(ii) MOLECULE TYPE protein 

(xi) SEQUENCE DESCRIPTION SEQ ID NO 29 

Gly Phe Asp Gly Trp Arg Hie Asp Tyr Val 
15 10 

(2) INFORMATION FOR SEQ ID NO 30 
(i) SEQUENCE CHARACTERISTICS 

(A) LENGIH 10 amino acids 

(B) TYPE amino acid 
(D) TOPOLOGY linear 

(ii) MOLECULE TYPE protein 

(xi) SEQUENCE DESCRIPTION SEQ ID NO 30 

Lys Gly Tyr Gly Ala Trp Val Val Arg Asp 
15 10 

(2) INFCXiMATICN FOR SEQ ID NO 31 
(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH 10 amino acids 

(B) TYPE amino acid 
(D) TOPOLOGY linear 

(ii) MOLECULE TYPE protein 

(Xi) SEQUENCE DESCRIPTION SEQ ID NO 31 



Trp Leu Asn Trp Trp Gly Gly Trp Ala Val 



WO 94/19454 



46 



PCT/DK94/00071 



10 



(2) EtfFOKflATECN FOR SEQ ID NO 32 
(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH 10 amino acids 

(B) TYPE amino acid 
(D) TOPOLOGY linear 

(ii) MOLECULE TYPE protein 

(Xi) SEQUENCE DESCRIPTION SEQ ID NO 32 

Gly Glu Tyr Trp Asp Uir Asn Val Asp Ala 
1 5 10 

(2) INFORMATION FOR SEQ 3D NO 33 
(i) SEQUENCE CHARACTERISTICS 

(A) LENGIH 10 amino acids 

(B) TYPE amino acid 
(D) TOPOLOGY linear 

(ii) MOLECULE TOTE protein 

(xi) SEQUENCE DESCRIPTION SBQ ID NO 33 

Leu Lai Ser Trp Ala Tyr Glu Ser Gly Ala 
15 10 



(2) INPCEMATICN FOR SEQ ID NO 34 
(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH 10 amino acids 

(B) TYPE amino acid 
(D) TOPOLOGY linear 

(ii) MOLECULE TYPE protein 

(Xi) SEQUENCE DESCRIPTION SEQ ID NO 34 

Lys Val Phe Asp Hie Pro Leu Tyr Tyr Lys 
15 10 



(2) INFORMATION FOR SEQ ID NO 35 
(i) SEQUENCE OiARACTERESTICS 

(A) LENGIH 10 amino acids 

(B) TYPE amino acid 
(D) TOPOLOGY linear 

(ii) MOLECULE TYPE protein 

(Xi) SEQUENCE DESCRIPTION SEQ ID NO 35 



Met Asp Glu Ala Hie Asp Asn Asn Asn lie 
15 10 
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(2) INHUMATION FOR SEQ ID NO 36 
(i) SEQUENCE CHARACTERISTICS 

(A) LENGIH 10 amino acids 

(B) TYPE amino acid 
(D) TOPOLOGY linear 

(ii) MOLECULE TYPE protein 

(xi) SEQUENCE DESCRIPTION SEQ ID NO 36 

Pro Ala Leu Val Tyr Ala Leu Gin Asn Gly 
15 10 



(2) INFORMATION FOR SEQ ID NO 37 
(i) SEQUENCE CHARACTERISTICS 

(A) LENGIH 10 amino acids 

(B) TYPE amino acid 
(D) TOPOLOGY linfwr 

(ii) MOLECULE TYPE protein 

(xi) SEQUENCE DESCRIPTION SEQ ID NO 37 

Gin Thr Val Val Ser Arg Asp Pro Phe Lys 
15 10 

(2) INFORMATION FOR SEQ ID NO 38 
(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH 10 amino acids 

(B) TYPE amino acid 
(D) TOPOLOGY linear 

(ii) MOLECULE TYPE protein 

(xi) SEQUENCE DESCRIPTION SEQ ID NO 38 

Ala Val Hit Hie Val Ala Asn His Asp Thr 
15 10 

(2) INFORMATION FOR SEQ ID NO 39 
(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH 10 amino acids 

(B) TYPE amino acid 
(D) TOPOLOGY linear 

(ii) MDLBCULE TYPE protein 

(xi) SEQUENCE DESCRIPTION SEQ ID NO 39 

Asp lie lie Trp Asn Lys Tyr Pro Ala Tyr 
15 10 

(2) INFORMATION FOR SEQ ID NO 40 
(i) SEQUENCE CHARACTERISTICS 
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(A) LENGTH 10 amino acids 

(B) TYPE amino acid 
(D) TOPOLOGY linear 

(ii) MOLECJJLE TYPE protein 

(Xi) SEQUENCE DESCRIPTION SEQ ID NO 40 

Ala Phe lie Leu Thr Tyr Glu Gly Gin Pro 
15 10 



(2) INK2FMAHCN FOR SEQ ID NO 41 
(i) SEQUENCE CHARACTERISTICS 

(A) LENGOH 10 amino acids 

(B) TYPE amino acid 
(D) TOPOIOGY linear 

(ii) MOLECULE TYPE prot ein 

(xi) SEQUENCE DESCRIPTION SEQ ID NO 41 

Val lie Phe Tyr Arg Asp Hie Glu Glu Trp 
15 10 

(2) INP0BMAT1CN FOR SEQ ID NO 42 
(i) SEQUENCE CHARACTERISTICS 

(A) LENGIH 10 amino acids 

(B) TYPE amino acid 
(D) TOPOLOGY linear 

(ii) MOLECULE TYPE protein 

(xi) SEQUENCE DESCRIPTICN SEQ ID NO 42 

Leu Asn Lys Asp Lys Leu lie Asn Leu lie 
15 10 



(2) information FOR SEQ ID NO 43 
(i) SEQUENCE CHARACTERISTICS 

(A) LENSIH 10 amino acids 

(B) TYPE amino acid 
(D) TOPOIOGY linear 

(ii) MOLECULE TYPE protein 

(xi) SEQUENCE DESCRIPTION SEQ ID NO 43 

Trp lie His Asp His Leu Ala Gly Gly Ser 
15 10 

(2) INFORMATION FOR SEQ ID NO 44 
(i) SEQUENCE CHARACTERISTICS 

(A) LENGIH 10 amino acids 

(B) TYPE amino acid 
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(D) TOPOLOGY linear 
(ii) MOLECULE TYPE protein 
(xi) SEQUENCE DESCRIPTION SEQ 3D NO 44 

Thr Thr lie Val Tyr Tyr Asp Asn Asp Glu 
1 5 10 

(2) INFTSTOATION FOR SEQ ID NO 45 
(i) SEQUENCE CHARACTERISTICS 

(A) LENGIH 10 amino acids 

(B) TYPE amino acid 
(D) TOPOLOGY lirwrr 

(ii) MOLECULE TYPE protein 

(xi) SEQUENCE DESCRIPTION SEQ ID NO 45 

Leu lie Phe Val Arg Asn Gly Asp Ser Arg 
15 10 

(2) INFORMATION FOR SEQ 3D NO 46 
(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH 10 amino acids 

(B) TYPE amino acid 
(D) TOPOLOGY linear 

(ii) MOLECULE TYPE p ro t ei n 

(xi) SEQUENCE DESCRIPTION SEQ 3D NO 46 

Arg Pro Gly Leu lie Thr Tyr lie Asn Leu 
15 10 

(2) INFORMATION PGR SEQ 3D NO 47 
(i) SEQUENCE CHARACTERISTICS 

(A) LENGIH 10 amino acids 

(B) TYPE amino acid 
(D) TOPOLOGY linear 

(ii) MOLECULE TYPE protein 

(xi) SEQUENCE DESCRIPTION SEQ 3D NO 47 

Ser Pro Asn Trp Val Gly Arg Trp Val Tyr 
15 10 

(2) INFORMATION FOR SEQ 3D NO 48 
(i) SEQUENCE CHARACTERISTICS 

(A) LENGIH 10 amino acids 

(B) TYPE amino acid 
(D) TOPOLOGY linear 

(ii) MOLECULE TYPE protein 

(xi) SEQUENCE DESCRIPTION SEQ 3D NO 48 
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Val Pro Lys Phe Ala Gly Ala Cys He His 
15 10 

(2) INFORMATION FOR SEQ ID NO 49 
(i) SEQUENCE CHARACTERISTICS 

(A) LENGIH 10 amino acids 

(B) TYPE amino acid 
(D) TOPOLOGY linear 

(ii) MDLECQLE TYPE protein 

(xi) SEQUENCE DESCRIPTION SEQ ID NO 49 

Glu Tyr Thr Gly Asn Lai Gly Gly Trp Val 
15 10 

(2) INFORMATION FOR SEQ ID NO 50 
(i) SEQUENCE CHARACTERISTICS 

(A) LENGIH 10 amino acids 

(B) TYPE amino acid 
(D) TOPOLOGY linear 

(ii) MDLECQLE TYPE protein 

(xi) SEQUENCE DESCRIPTION SEQ ID NO 50 

Asp Lys Arg Val Asp Ser Ser Gly Tcp Val 
15 10 

(2) INFORMATION FOR SEQ ID NO 51 
(i) SEQUENCE CHARACTERISTICS 

(A) LENGtfH 10 amino a c ids 

(B) TYPE amino acid 
(D) TOPOLOGY linear 

(ii) MOLECULE TYPE protein 

(xi) SEQUENCE DESCRIPTION SEQ ID NO 51 

Tyr Leu Glu Ala Pro Pro His Asp Pro Ala 
15 10 

(2) INPOraiAITCN FDR SEQ ID NO 52 
(i) SEQUENCE CHARACTERISTICS 

(A) LENdH 15 amino acids 

(B) TYPE amino acid 
(D) TOPOLOGY linear 

(ii) MOLECULE TYPE protein 

(xi) SEQUENCE DESCRIPTION SEQ ID NO 52 

Asn Gly Tyr Tyr Gly Tyr Ser Val Trp Ser Tyr Cys Gly Val Gly 
15 10 15 
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CLAIMS 

1. A DNA construct comprising a DNA sequence encoding a 
Pyrococcus a-amylase or a variant thereof having a-amylase 

5 activity and/or being immulogically cross-reactive with a 
Pyrococcus a-amylase, said DNA sequence 

i) comprises a partial DNA sequence as shown in SEQ ID Nos. 

2, 3, 4, 5 and/or 6 or an analogue of said partial sequence 
10 capable of hybridizing with an oligonucleotide probe prepared 

on the basis of the DNA sequence shown in SEQ ID No. 2, 3, 4, 
5 and/ or 6, or 

ii) corresponds to a genomic Pyrococcus DNA sequence located 
15 within 5 kb of a genomic DNA sequence hybridizing with an 

oligonucleotide probe prepared on the basis of the DNA se- 
quence shown in SEQ ID No. 2, 3, 4, 5 and/or 6, or 

iii) comprises the DNA sequence shown in SEQ ID No. 1 or an 

2 0 analogue of said sequence capable of hybridizing with an 

oligonucleotide probe prepared on the basis of the DNA 
sequence shown in SEQ ID No. 1. 

2. The DNA construct according to claim 1, in which the DNA 
25 sequence encoding the Pyrococcus a-amylase or a variant 

thereof corresponds to a genomic Pyrococcus DNA fragment 
located between and optionally comprising the partial DNA 
sequences identified in the appended SEQ ID Nos. 5 and 6 or 
analogues thereof capable of hybridizing with an oligonu- 

3 0 cleotide probe prepared on the basis of the DNA sequence 

shown in SEQ ID No. 5 and 6, respectively. 

3 . A DNA construct comprising a Pyrococcus DNA sequence hy- 
bridizing with a DNA sequence having any of the properties 

35 i)-iii) as defined in claim 1. 

4 . A DNA construct encoding an enzyme exhibiting amylolytic 
activity and 
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a) comprising a DNA sequence encoding at least one of the 
partial amino acid sequences 





( a ) 


AKYLELEEGG (SEQ ID NO 10 


5 


( c ) 




xu 




1 o \ 

14 ) 




( e ) 


/■» T O 7V XTaTT DDD f OTTO 

GISAIWliPFF ( o.dQ 


ID 


JNvJ 


14) 




(g) 


GYDPYDYFDL ( SEQ 


ID 


NO 


16) 




(i) 


TRFGSKEELV ( SEQ 


ID 


NO 


18) 




(k) 


IKVIADWIN ( SEQ 


XU 


NO 




10 


(m) 


PFVGDYTWTD ( SEQ 


ID 


NO 


22 ) 




(o) 


ANYLDFHPNE (SEQ 


ID 


NO 


24) 




(q) 


GFPDICHHKE ( SEQ 


ID 


NO 


26) 




(s) 


ESYAAYLRSI (SEQ 


ID 


NO 


28 ) 




(u) 


KGYGAWWRD ( SEQ 


ID 


NO 


30) 


15 


(x) 


GEYWDTNVDA ( SEQ 


ID 


NO 


32) 




(z) 


KVFDFPLYYK ( SEQ 


ID 


NO 


34) 




(B) 


PALVYALQNG ( SEQ 


ID 


NO 


36) 




(D) 


AVTFVANHDT ( SEQ 


ID 


NO 


38) 




(F) 


AFILTYEGQP ( SEQ 


ID 


NO 


40) 


20 


(H) 


LNKDKLINLI (SEQ 


ID 


NO 


42) 




(J) 


TTIVYYDNDE ( SEQ 


ID 


NO 


44) 




(L) 


RPGLITYINL ( SEQ 


ID 


NO 


46) 




(N) 


VPKFAGACIH (SEQ 


ID 


NO 


48) 




(P) 


DKRVDSSGWV ( SEQ 


ID 


NO 


50) 


25 


(R) 


NGYYGYSVWSYCGVG ( SEQ 


ID 



); (b) VIMQAFYWDV (SEQ ID NO 11); 



(d) 


RSKIPEWYEA ( SEQ 


ID 


NO 


13) 


(f) 


SKGMSGGYSM ( SEQ 


ID 


NO 


15) 


(h) 


GEYYQKGTVE ( SEQ 


ID 


NO 


17) 


(j) 


RLIQTAHAYG (SEQ 


ID 


NO 


19) 


(1) 


HRAGGDLEWN (SEQ 


ID 


NO 


21) 


(n) 


FSKVASGKYT ( SEQ 


ID 


NO 


23) 


(P) 


LHCCDEGTFG ( SEQ 


ID 


NO 


25) 


(r) 


WDQYWLWKSN ( SEQ 


ID 


NO 


27) 


(t) 


G FDGWRFD Y V ( SEQ 


ID 


NO 


29) 


(v) 


WLNWWGGWAV(SEQ 


ID 


NO 


31) 


(y) 


LLSWAYESGA ( SEQ 


ID 


NO 


33) 


(A) 


MDEAFDNNNI (SEQ 


ID 


NO 


35) 


(C) 


QTWSRDPFK ( SEQ 


ID 


NO 


37) 


(E) 


DIIWNKYPAY (SEQ 


ID 


NO 


39) 


(G) 


VIFYRDFEEW ( SEQ 


ID 


NO 


41) 


(I) 


WIHDHLAGGS (SEQ 


ID 


NO 


43) 


(K) 


LIFVRNGDSR ( SEQ 


ID 


NO 


45) 


(M) 


SPNWVGRWVY(SEQ 


ID 


NO 


47) 


(O) 


EYTGNLGGWV ( SEQ 


ID 


NO 


49) 


(Q) 


YLEAPPHDPA ( SEQ 


ID 


NO 


51) 



NO 52) , and/or 



b) comprises a DNA sequence hybridizing with an oligonucleot- 
ide probe prepared on the basis of any of the DNA sequence 
shown in SEQ ID Nos. 1-6, on the basis of the amino acid 
30 sequence encoded by any of the said DNA sequences or the 

amino acid sequence shown in SEQ ID No. 9, or on the basis of 
any of the partial amino acid sequences (a)-(R) listed in a) 
above, and /or 



35 c) encodes a polypeptide being at least 70% homologous with 
the amino acid sequence shown in SEQ ID No. 9. 
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5. A DNA construct according to claim 4, in which the enzyme 
exhibiting amylolytic activity is an a-amylase, in particular 
a Pyrococcus a-amylase or a variant thereof having a-amylase 
activity. 

5 

6. A DNA construct according to any of claims 1-5, in which 
the DNA sequence is derivable from a thermophilic archae- 
bacterium. 

10 7. The DNA construct according to claim 6, in which the DNA 
sequence is derivable from a strain of Pyrococcus woesei or 
from a strain of Pyrococcus furiosus . 

8. The DNA construct according to claim 7, in which the DNA 
15 sequence is derivable from the Pyrococcus woesei strain DSM 
3773 or the Pyrococcus furiosus strain DSM 3638, or from a 
mutant or derivative of any of these strains having an a- 
amylase producing capability. 

20 9. A vector harbouring a DNA construct according to any of 
claims 1-8. 

10. The vector according to claim 9, which is a plasmid or a 
bacteriophage . 

25 

11. The vector according to claim 9 or 10, which is an ex- 
pression vector further comprising DNA sequences permitting 
expression of the amylolytic enzyme, such as a Pyrococcus a- 
amylase or variant thereof. 

30 

12. A host cell harbouring a DNA construct according to any 
of claims 1-8 or a vector according to any of claims 9-11. 

13. The host cell according to claim 12, which is a microor- 
35 ganism. 

14. The host cell according to claim 13, which is a bacterium 
or a fungus. 
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15. The host cell according to claim 14, which is a gramposi- 
tive bacterium such as Bacillus subtllis , Bacillus licheni- 
formis, Bacillus lentus , Bacillus brevis , Bacillus stearo- 
thermophilus , Bacillus alkalophilus, Bacillus amyloliquefa- 
5 ciens, Bacillus coagulans , Bacillus circulans, Bacillus 
lautus, Bacillus thuringiensis or Streptomyces lividans or 
Streptomyces murinus , or gramnegative bacteria such as E.co- 
li. 

10 16. A Bacillus cell, which is different from a cell of Bacil- 
lus licheniformis, and which harbours a DNA construct com- 
prising a DNA sequence encoding a Pyrococcus a-amylase or a 
variant thereof. 

15 17. The Bacillus cell according to claim 16, in which the DNA 
sequence encoding the Pyrococcus a-amylase or a variant 
thereof is derivable from a strain of Pyrococcus woesei or 
from a strain of Pyrococcus furiosus. 

20 18. The Bacillus cell according to claim 17, in which the DNA 
sequence is derivable from the Pyrococcus woesei strain DSM 
3773 or the Pyrococcus furiosus strain* DSM 3638 or a deriva- 
tive or mutant of any of these strains capable of producing 
a-amylase activity. 

25 

19. The Bacillus cell according to claim 16, which is derived 
from Bacillus subtilis , Bacillus lentus, Bacillus brevis, 
Bacillus stearothermophilus , Bacillus alkalophilus, Bacillus 
amyloliquefaciens, Bacillus coagulans, Bacillus circulans, 

3 0 Bacillus lautus or Bacillus thuringiensis. 

20. A process for producing an amylolytic enzyme, in particu- 
lar a Pyrococcus a-amylase or a variant thereof, comprising 
culturing a cell according to any of claims 12-19 in a suit- 

35 able culture medium under conditions permitting expression of 
the amylolytic enzyme, and recovering the resulting amyloly- 
tic enzyme from the culture. 
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21. An amylolytic enzyme, in particular a Pyrococcus a-amy- 
lase or a variant thereof having a-amylase activity, produced 
by the process according to claim 20. 

5 22. A Pyrococcus a-amylase comprising the amino acid sequence 
shown in SEQ ID No. 9 or a variant thereof having a-amylase 
activity and/or being immunologically cross-reactive with the 
a-amylase comprising the amino acid sequence shown in SEQ ID 
No. 9 and/or comprising an amino acid sequence being at least 
10 70% homologous to the amino acid sequence shown in SEQ ID No. 
9. 

23. An amylolytic enzyme which 



20 



25 



15 a) 

[a 
c 
e 

g 

i 
k 
m 
o 

q 

s 
u 

X 

z 
B 
D 
F 
H 
J 
L 
N 
P 
R 



30 



35 



comprises at least one of 
AKYLELEEGG (SEQ ID NO 10) 
PGGGIWWDHI(SEQ ID NO 12) 

GISAIWLPPP(SEQ ID NO 14); (f 

GYDPYDYFDL ( SEQ ID NO 16) ; (h 

TRFGSKEELV ( SEQ ID NO 18); (j 

IKVIADWIN(SEQ ID NO 20) ; (1 

PFVGDYTWTD (SEQ ID NO 22); (n 

ANYLDFHPNE ( SEQ ID NO 24); (p 

GFPDICHHKE ( SEQ ID NO 26) ; (r 

ESYAAYLRSI (SEQ ID NO 28); (t 

KGYGAWWRD ( SEQ ID NO 30); (v 

GEYWDTNVDA ( SEQ ID NO 32); (y 

KVFDFPLYYK ( SEQ ID NO 34); (A 

PALVYALQNG ( SEQ ID NO 36) ; (C 

AVTFVANHDT ( SEQ ID NO 38); (E 

AFILTYEGQP (SEQ ID NO 40); (G 

LNKDKLINLI (SEQ ID NO 42); (I 

TTIVYYDNDE ( SEQ ID NO 44); (K 

RPGLITYINL ( SEQ ID NO 46); (M 

VPKFAGACIH ( SEQ ID NO 48); (O 

DKRVDSSGWV ( SEQ ID NO 50); (Q 
NGYYGYSVWSYCGVG (SEQ ID NO 52) , and/ or 



the following partial sequences 
d) 



ID 


NO 


ID 


ID 


NO 
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ID 


NO 
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ID 


NO 
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ID 


NO 
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ID 


NO 
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ID 


NO 
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ID 


NO 


27) 


ID 


NO 


29) 


ID 


NO 


31) 


ID 


NO 


33) 


ID 


NO 


35) 


ID 


NO 


37) 


ID 


NO 


39) 
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b) is encoded by a DNA sequence hybridizing with an oligonu- 
cleotide probe prepared on the basis of any of the DNA 
sequence shown in SEQ ID Nos. 1-6, on the basis of the amino 
acid sequence encoded by any of the said DNA sequences or the 

5 amino acid sequence shown in SEQ ID No. 9, or on the basis of 
any of the partial amino acid sequences (a)-(R) listed in a) 
above, and/ or 

c) is at least 70% homologous with the amino acid sequence 
10 shown in SEQ ID No. 9. 

24. An amylolytic enzyme according to claim 23, in which the 
enzyme is a Pyrococcus a-amylase or a variant thereof having 
a-amylase activity. 

15 

25. A starch liquefaction process which comprises subjecting 
an aqueous starch slurry to enzymatic liquefaction in the 
presence of a Pyrococcus a-amylase or a variant thereof 
according to any of claims 21-24. 

20 

26. The starch liquefaction process according to claim 25, in 
which the process is performed essentially without addition 
of a calcium salt to the starch slurry. 

25 27. The starch liquefaction process according to claim 25 or 
26, in which the process is conducted by jet-cooking at a 
temperature in the range of 100 to 140°C for up to 120 min- 
utes, optionally followed by reduction of the temperature to 
be held in the range of 90 to 100°C for about 30 to 120 min- 

3 0 utes, after which the thus liquefied starch is stable against 
retrogradation, the pH being held at about 4.0 to 5.5 throu- 
ghout the process. 

28. The starch liquefaction process according to any of 
35 claims 25-27, whereby the liquefied starch is subjected to 

enzymatic saccharif ication in the presence of a glucoamylase, 
substantially without an intermediate pH adjustment. 
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29. The starch liquefaction process according to claim 28, 
further comprising ethanol fermentation with yeast simulta- 
neously with or subsequent to said saccharif ication. 

5 30. A starch liquefaction method comprises 

a) culturing a suitable host cell according to any of claims 
12-19 carrying a DNA sequence encoding a Pyrococcus a-amylase 
or variant thereof having a-amylase activity in a suitable 

10 culture medium under conditions permitting expression of the 
Pyrococcus a-amylase or variant thereof, and recovering the 
resulting a-amylase or variant thereof from the culture, and 

b) subjecting an aqueous starch slurry to enzymatic liquefac- 
15 tion in the presence of the a-amylase or variant thereof 

recovered in step a) . 

31. Use of an amylolytic enzyme as defined in any of claims 
21-24 for starch liquefaction and/ or saccharif ication, debra- 
20 nching of starch, production of syrups, production of cyclod- 
extrin, or production of oligosaccharides. 
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